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Introduction 


When we were solving sets of simultaneous equations like 
3X, — xX, + 4x3 =6, 


we found we were particularly interested in the rectangular blocks of numbers 


2 3 > —4 s 
1 2 1 and 3 
3 —1 4 6 


These rectangular blocks are called matrices (plural of matrix). There are other 
situations where such blocks of numbers occur. For instance, look at the 
information in Table 1 below showing the prices of different articles in competing 


shops. (Prices are given in pence.) 
Dessert Dessert 
spoon fork 


Table 1 
ia oo oe 
The numerical information in this table is contained in the matrix 


You will meet a matrix like this in Section 2. 
Another example of a simple matrix can be obtained from the vector 
r=d,i+ a,j + a3k. 


The essential information about the magnitude and direction of r is contained in 
the three numbers a,, a, and a,. So it is quite usual to represent r by a column 
ay 
matrix |d, |, or sometimes by a row matrix [a, a, a3]. 
a, | 


It is because vectors are often represented by such column (or row) matrices, that 
such matrices are in fact usually referred to as column vectors (or row vectors). 
These terms are extended to refer to any matrix consisting of a single column (or 
row) of any length. For example, [1 25 4 7] is referred to as a row 

vector. 


Manipulation of matrices like the ones above occurs sufficiently often for it to be 
worth while to develop an algebra of matrices which will enable us to refer simply 
and briefly to the manipulations we carry out. 


You saw in the unit on complex numbers how it was possible to create an algebra 
for things other than real numbers, with suitable definitions for ‘+’ and ‘x’. We 
can manipulate complex numbers in the same way as real numbers, simply 
because they obey the same laws. When we have defined what we mean by matrix 
‘addition’ and ‘multiplication’, we will find that the rules for matrix algebra are 
largely similar to those for real and complex numbers. The main purpose of this 
unit is to investigate these and to learn how to use the resulting algebra. For 
comparison, the main laws for real numbers are given in the Handbook. 
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Study guide 


You must make sure that you understand the contents of Sections 1 and 2 and 
Subsection 3.1 (the tape subsection) before you watch the television programme. 
This is probably about three hours’ work. 


If you have studied matrices before, you should find this a reasonably easy unit. It 
is for this reason that there are not too many matrix multiplication examples 
included in the body of the text. However, if you have never met matrix 
multiplication before, you will probably find that you need to do some more 
examples, as it takes a bit of time to get facility with this operation. It is for this 
reason that there are plenty of multiplication examples at the end of Section 2. In 
general, the end of section exercises should be used, either as extra practice if you 
need it, or for revision purposes. The end of unit test is there for you to see if you 
have assimilated the contents of the unit! 


It is expected that you will study Sections 1 and 2 in the first working session. 
This will be a bit heavy if you haven’t studied matrices before, but some of the 
later sections will take much less time. | 


Finally, Section 5 is about determinants. Determinants are not matrices, but to 
study this section you need to know the definition of a matrix. 


1 Some simple matrix operations 


1.1 Matrix notation 


You will remember from Unit 9 on simultaneous equations that a matrix is a 
rectangular block of numbers. When we wanted to refer to a general set of two 
equations in two unknowns, we wrote 


aX, +),x%, =, 
GX; + )5X5 = C5. 


and when we wished to use the matrix of the left-hand side coefficients we wrote 


This notation is fine when discussing small sets of equations or small matrices, but 
we would run out of letters of the alphabet if we wished to talk about larger 
matrices. It is for this reason that we adopt the notation 


441X, + ay2x, =), 
Ay 1X1 + Ay2X_ = dy. 
This very convenient notation means that we never run out of names for the 


coefficients, and can cope with any number of equations or any size matrix in this 
way. 


It is usual practice to use a capital letter when referring to matrices. In this course, 
we shall use a capital letter in bold type. Thus the matrix 


will be referred to briefly as A. Other matrices could be called X or B, for example. 
The only exception to this notation is that used for matrices consisting of a single 
column. It is common practice to refer to these ‘column vectors’ using a lower-case 
letter. We shall use a lower-case letter in bold type for these. Thus the column 
vector 


will be referred to briefly as x. 
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Now, when we use the letter A to represent a matrix, it tells us nothing about its 
size or its contents. Suppose we wish to refer to a matrix which has two rows and 
three columns. We call this a two by three matrix and write it as ‘2 x 3’. 


Note: it is important to specify the number of rows before the number of columns. 


Exercise 1 

Write down examples of matrices of the following size. 
in * 2 (ii) 3x 4 (11) 4 x 1 ivy d <2 
[Solution on p. 55] 


Each individual entry in a matrix is called an element. The element q;; iN a matrix 
Q refers to the element in the ith row and jth column. 


Example 1 
The elements of a 2 x 4 matrix Q are labelled in the following way: 


q=| % G12 913 vee 

G21 G22 G23 G24 

In this way, each individual element in a given matrix has its own name, and we 
know where to find it. 


Exercise 2 
How would we refer to the element in the fourth row and fifth column of a matrix P? 
[Solution on p. 55] 


In general, a matrix A with m rows and n columns is called an 
m X n matrix: 


Amt m2 ss Amn 
In this matrix, a,, is the element in the ith row and jth column. 


In the case of a column vector x, it is only necessary to specify one subscript. For 
instance, we know that x, is the element in the third row of a column vector x. 


1.2 Equality of matrices 


We define ‘A = B’ to mean that A and B are identical, i.e. 
(a) that the matrices A and B are the same size, 


(b) that the corresponding elements are equal. 


So, if a-{o a Se and B=| ( : | 


then A = B means that 


) 
a,, = 1, ay, = 2, ain = 1, 


In this instance, the single matrix equation provides us with a shorthand notation 
for writing down six real-number equations. 
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Exercise 3 
Write down the real-number equations implied by the statement A = B, if: 


- ax + by rs 
) A= = : 
(11 | ay and B A 


Solution on p. 55] 


1.3. Matrix addition 


You know from Unit 14 on vector algebra, that the sum of the two vectors 
a=d,i+a,j+a,k and b = b,i+ b,j + bk 


bm 


a+b=(a, + 5,)it+ (a, + 5,)j + (a, + 53)k. 
In our matrix notation, this would be written as 


a, b, a, +5, 
a,|} +|b,| =|a,4+5),}. 
a, b, a,+b, 


So, if this matrix representation of vectors is to be at all useful, the definition of 
matrix addition would have to include this statement above. A fairly intuitive 
extension of this gives us the following definition. 


If A and B are two matrices of the same size, we define 


A+B 


to mean the single matrix formed by adding the corresponding 
elements of A and B. 


Example 2 
2 3 1 0 2+ 1 3+0 
1 4;+10 4\ = 1+0 4+4 
—] ‘ 1 3 —1+1 5+ 3 
3 3 
ae 8 
0 8 
However, if 


A 1 2 d B= : 


we would not be able to add these matrices as, by definition, matrices must be the 
same size before we can add them. 


Another example of matrix addition was seen in the Gaussian elimination method 
in Unit 9. Each row of the matrix we used in the elimination process can itself be 
considered as a matrix. We were quite cheerfully adding one row to another so as 
to obtain a new row during each stage of the elimination process. 


Exercise 4 
Given that 
2 1 —] 4 0 p: 3 
Aa 4 3 : B= 1 1 0}, C=} : 
1 5 0 1 1 6 3 


a 


find (where possible): 
(i) A+B, (ii) A+ C, (ii) C + x, (iv) (x + y)+z. 
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Exercise 5 


Write down the real-number equations implied by x + y = z, where x, y and z are the 
matrices defined in Exercise 4. 


Exercise 6 

A matrix C is formed by adding two m x n matrices A and B. 
(1) What size will C be? 

(ul) How do we form the element c; ;? 

[Solutions to Exercises 4-6 on p. 55] 


Before we can do any algebra with our matrices, we must check whether the rules 
for addition which operate for real numbers are true for matrices. In the following 
proofs we shall assume that the matrices are of the same size. 


hs. prove the commutative law for matrices for addition: 
A+B=B+4A. 


Proof 


A typical element in the ith row and = column of A + B is a,, + b,;. The 
equivalent element for B + A is bj + aj; 


So, as we know from the commutative law for real numbers that 
a;; + b;; = b;; + a;;, it follows that 


A+B=B+A. 
2. To prove the associative law for matrices for addition: 
A+(B+C)=(A+B)+C. 


Proof 


A ‘typical’ element of A + (B + C) is a;; + (b;; + c,;). The equivalent element in 
(A + B) + C is (a,, + b,,) + ¢,;. Now, from the associative law for real 
numbers, we know that 


a;; + (b;; + ¢,;) = (a; + 5;; oF ee. 
Hence we derive the associative rule for matrices: 
A+(B+C)=(A+B)+C. 
Having proved these two laws for matrices, we can now use them when required. 


It is useful at this stage to define the zero matrix 0, and the negative of a matrix A. 
These are analogous to the number zero, and the negative of a number a, in our 
ordinary number system. 


We define a zero matrix as a matrix consisting entirely of zeros. 
Such a matrix (whatever its size) is denoted by 0, and has the 


property that 
A+0=A. 


The size of a zero matrix is fixed by the context in which it arises. For instance, if 
A above were an m x n matrix, then the ‘0’ would also be an m x n matrix. 


_A matrix —A is one whose elements are the negatives of those of 
A. It has the property that — 


A+(=A)=0 


For instance, ifA = [1 —2], then —A = [-1 Zt 


Whenever we write A — B, this will be taken to mean A + (—B). In practice we 
just subtract the corresponding elements of B from A. 
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Exercise 7 
—2 -1 ; 
Given that A = 4 = write down the matrix —A, and the zero matrix such that 
A+(-A)=0. 
Exercise 8 
3 2 2 eee = 
IfA=|1 2; and B= |-1 li,fndA-—B. . 
a 2 -3 | 


Solutions to Exercises 7 and 8 on p. 55] 


1.4 Multiplication of a matrix by a number 
The effect of performing the operation 
A+A 


is to obtain a matrix whose elements are double those of A. Similarly, if we add 
A +A-+ A, we obtain a matrix whose elements are three times those of A. It is 
tcuaptine to think of A+ A as 2A, and A+ A+ A as 3A, and we extend this idea 
to make the following definition. 


If k is a real number, and A is any matrix, then we define 


_kA (or Ak) 


to be a matrix (which is the same size as A) in which each 
element is the corresponding element of A multiplied by k. 


It should be noted from this definition that 
(-—1)A=—-—A and 0A = 0. 


Example 3 
2 1 3 10 5 15 
Wa=| 7 5 of then a=|7 10 2 


With this definition of multiplication of a matrix by a number, it can be shown 
that: 


(i) (k,k,)A =k,(k,A), 

(ii) k(A + B)=kA + kB, 

(ii) (k, +k,)A=k,A+k,A. 

These laws, together with the laws for matrix addition, mean that, provided our 
matrices are of the same size and we confine ourselves to the algebra in this 


section, we can manipulate matrix algebra in much the same way as real number 
algebra. 


Exercise 9 
Z 1 —1] 0 

Given k = 5, m= 2, A = | 3 2| and B= 1 0 |, write as a single matrix the 
4 1 1 -1 

following expressions: 

(i) kA, 

(ii) k(A + mB), 

(iii) kA — mB, 


(iv) mA +mB-+ kA — KB. 
[Solution on p. 55] 


Example 4 
We can find the matrix X from the matrix equation 


MAT 3ASs A A 
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by using the laws and rules of this section. The references are to the list of rules 
given in the summary of this section. 


2A+6X—-A=B+A (Summary 3(iv) then (iii)) 
6X +2ZA—A=B+A (Summary 3(i)) 


6X +A =B+A. (Summary 3(v)) 
Adding —A to both sides of the equation, we get 
6X = B (Summary 3(v)) 


and dividing both sides by 6, we get 
X = ZB. (Summary 3(iii)) 
In fact, as you can see, this solution involves only the operations discussed in this 


section. We work with this equation in exactly the same way as we would with an 
equation involving numbers. 


Exercise 10 
(i) Simplify the expression 
2(A + 3B) — 4(A — B). 
(ii) Solve the following equation for X: 
X + 3(A + X) = 2X — A. 
[Solution on p. 55] 


Summary of Section 1 


1. Notation 


(i) A matrix is represented by a capital letter in bold type, ca AE... 

(ii) Column vectors are represented by lower case letters in bold type, e.g. a,b,... 
(iii) A matrix.A with m rows and n columns is called an m x n matrix. 

(iv) a,,; is the element in the ith row and jth column of a matrix A. 


2. Algebra 


(i) Equality: A = B means that A and B are the same size, and their 
corresponding elements are equal. 


(11) Matrix addition: if two matrices A and B are the same size, then 
A+B 
is the single matrix formed by adding the corresponding elements of A and B. 
(iii) kA is the matrix whose elements are k times those of A. 
Two particular matrices are defined: 
(iv) a matrix 0 (consisting entirely of zeros) with the property thatA +0=A; 
(v) amatrix —A whose elements are the negatives of those of A. It has the 
property that A + (—A)=0. 
3. Laws 
Given matrices A, B, C (all the same size), then: 
i) A+B=B+4A, aes 
ii) A+(B+C)=(A+B)+¢G 
(ui) (k,k,)A = k, (kA), 
(iv) k(A +B)=kA + kB, 
(v) (ky, +k,A=k,A+Kk,A. 


In this way, we start to build up an algebra for matrices very similar to the 
algebra for real numbers. 
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End of section exercise 
Exercise 11 

eee 
re 

a 


s-|' 2 Pp 2 a 42 
= ae To fs 
(i) Find, if possible, 


(a) A+ B, (b) B—C, (c) C+ 2x (d) x + 2y, (ce) A— 2B+C. 


(ii) Find the matrix P if 
2(P + 2A) + B = 3C. 
(Solution on p. 55] 


2 Matrix multiplication 


2.1 Motivation 
Example 1 


milk bread sugar 
(pints) (loaves) (kg) A B 


Table 2 


Table 1 


Table 1 above shows four people’s weekly milk, bread and sugar requirements, and 
Table 2 shows the cost (in pence) of these items in two supermarkets A and B. 
There is another table which can be constructed from these two tables—a table 
which shows how much the four people will have to pay for their requirements in 
either supermarket. 


Table 3 


To work out how much it would cost John to shop in A, he buys 


6 pints of milk at 18p each, 
3 loaves at 30p each, and 
2kg sugar at 35p each, 


giving a total cost of 


(6 x 18) + (3 x 30) + (2 x 35) 
= 268p. 


12 --MST204 20.2 


To work out how much it would cost Joyce to shop in B, we take Joyce’s row in 
Table 1 (her requirements), and the ‘B’ column in Table 2 (the cost of these 
requirements), and obtain the cost 


(7 x 17) +)(2 x 32) 4+ (i-« 33) 
= 216p. 
Exercise 1 


Fill in the rest of the entries in Table 3. 
[Solution on p. 56] 


This combination of the two tables could be expressed in matrix terms: 


3.23 : ee, 268 264 
110 2 2| combined with | 30 32| gives | 310 300 

ee ee a5" 33 1 ee 

i 43 495 482 


This process, of combining rows of the left-hand matrix with columns of the right- 
hand matrix to produce elements of a new matrix, is the operation which we call 
matrix multiplication. This is probably the operation which we perform most 
frequently when using matrices. You will see some examples of its use in this unit. 


5 
SS 5 
ad: Gs ay 1 b5! = You will notice the analogy 
Looe | z - with the formula for 
53) t calculating scalar products of 
a,b, + anb, + a3b; vectors. 


2.2 The definition of matrix multiplication 
When we write 
C = AB, 


we mean we are combining (multiplying) two matrices A and B to obtain a new 
matrix C. Each element of this matrix is formed by taking a row of the left-hand 
matrix A, and combining it with a column of the right-hand matrix B, along the 
lines indicated in the previous section. The following example should show how 

the individual elements of such a matrix C are formed. 


Example 2 
Find AB if 
6 -1 
A= ij § 4 and b= 0 
—1 1 
Solution 
Let 


aB=c =|“ le 


C21 C22 

where C,, 1s formed by taking row 1 of A and combining it with column 1 of B, 
giving 

Cy, =(2 x 6)4+ (1 x 0) + (3 x -1)=9. 
To get c,,, combine row 1 of A and column 2 of B, to give 

C12 =(2 x —1)+ (1 x 2)4+ (3 x 1) =3. 
To get c,,, combine row 2 of A and column 1 of B, to give 

Cz, = (4 x 6) + (5 x 0) 4+ (6 x —1) = 18. 
To get c,,, combine row 2 of A and column 2 of B, to give 


Q,= 4x —1 +6246 x p= 
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Thus we obtain 


pavement | 
& NN 
nn -_ 
ON W 
SRE RLS, 
| 
—_= CO ON 
| 
—= RQ — 
| 
Riyacoy 
permeors 
CO WO 
— 
N WwW 
L scitinwcauel 


So, to find an element c,, of a matrix C = AB, take row i of the left-hand matrix A, 
and combine it with column j of the right-hand matrix B in the manner described 
above. 


Exercise 2 


Find AB if 


(Solution on p. 56] 


There are more exercises on matrix multiplication later in this section. Make 
certain that you do enough of them to feel confident that you can do the process. 


The method above actually determines the size of C = AB. For instance in 
Example 2, to obtain an element c,,, we would have had to combine row 3 of A 
with row 1 of B. As A has no third row, neither has C. So we conclude that C has 
the same number of rows as the left-hand matrix A. Similarly, to obtain an element 
C,3, we would have to combine row 1 of A with column 3 of B. As B has no third 
column, neither has C. So we conclude that C has the same number of columns as 
the right-hand matrix B. 


Example 3 


If A is a 2 x 3 matrix, and B is a 3 x 4 matrix, then AB will be a 2 x 4 matrix. 
For instance, 


Exercise 3 
Find the size of AB, if A is a 3 x 5 matrix, and B is a 5 x 2 matrix. 
(Solution on p. 56] 


Finally, the rule for combining the elements of A and B has automatically imposed 
a restriction on the size of matrices we can multiply together. For instance, if a 

| b, 
typical row of A were [a, a, a3 4,] anda typical column of B were | 5, |, 

3 b, 

then we wouldn’t be able to do the specified row-column operation, as we would 
have an element a, left over. So we have the natural restriction, that to be able to 
form AB, the number of columns of A must equal the number of rows of B. 


Example 4 


If A is a 2 x 3 matrix, then if AB is to be formed, B will have to have 3 rows. For 
instance, the products 
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can all be formed, whereas to form the product 
eee See) e - 1 
ee ee poe 


is not possible. 


(Note, there is no restriction on the number of columns that B may have, or the 
number of rows that A may have.) 


Exercise 4 

If A is a 2 x 4 matrix, then what size must B be to be able to form 
(i) AB (11) BA? 

[Solution on p. 56] 


The restrictions on the sizes can be summed up as follows. 


Exercise 5 
Given the matrices 


(i) state which matrix products can be formed, 
(il) state the size of the resulting matrix products. 


Exercise 6_ 
Form all the possible matrix products from the matrices 


oe a = 
sa? 9 oe ee 


Exercise 7 
Given the matrices 


2 0 1 1 2 —1 
A= {—1] 2 1 and B= |-1 2 a 
3 4 -2 3 4 1 


find the matrix products AB and BA. 
[Solution to Exercises 5—7 on p. 56] 


The ideas of this subsection can be summarized in the following definition of 
matrix multiplication. 


The matrix C = AB can be formed if the number of columns of 
the left-hand matrix A equals the number of rows of the right- 
hand matrix B. The matrix C will have the same number of rows 
as A, and the same number of columns as B, and its elements are 
given by | 


Cig = 4j,0,, + j2b., + -*> + a,b 


in’ nj 
n 
= 3 Find; 
k=1 


where n is the number of columns of A. 
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2.3. The algebraic laws governing matrix multiplication 


Before we can freely use matrix multiplication, we need to know whether matrix 
multiplication is commutative and associative, and whether there is a distributive 
law. 


As you can see from the results of Exercise 7, AB and BA are not the same. Any 
counter-example would have been adequate to show that, in general, matrix 
multiplication is not commutative, that is, 


AB is not usually equal to BA. 


In fact, we could have seen this without doing an example at all. If we multiply an 
m x n matrix A to ann x p matrix B, we obtain an m x p matrix C. However, it is 
not even possible to form the product BA unless p is equal to m. 


As this is the first thing that we have discovered about matrices which differs 
significantly from real number algebra, we shall have to be very careful. 


For instance, if 
A=B (1) 


and we wish to multiply both sides of this matrix equation by another matrix C, 
then we have to decide either to multiply both sides of (1) by C on the left, to 
obtain 


CA = CB, (2) 
or we could (size permitting) multiply both sides of (1) on the right, to obtain 
AC = BC. (3) 


Both the matrix equations (2) and (3) would be logical consequences of (1), 
whereas there would be no justification in assuming that that AC and CB were 
equal. 


Despite this bad beginning to the investigation of the laws governing matrix 
multiplication, in fact the associative and distributive laws are exactly the same as 
those in real number algebra. However, as matrix multiplication is not 
commutative, there are two versions of the distributive law to prove: 


A(B+ C) = AB+ AC 

and 
(B+ C)A = BA + CA. 

The proofs of these, and of the associative law 
A(BC) = (AB)C 


are given as answers to the following exercise. They are there for completeness, but 
please don’t attempt to do them unless you are very brave, and have time on your 
hands! Exercises 16 and 17 at the end of the section will demonstrate these laws 
for three particular matrices. You are strongly advised to do these exercises now if 
you haven’t had much practice at matrix multiplication before. 


Exercise 8 (Optional) 


Assume (for simplicity) that A, B and C are all n x n square matrices. Prove, using the 
definition of matrix multiplication given in the box on p. 14, that: 


(i) A(B+C)=AB+AC, 
ii) (B+C)A =BA+ CA, 
(iii) A(BC) = (AB)C. 
[Solution on p. 56 | 


These laws, along with the laws of Section 1 and the inverse matrix in Section 4 
(the matrix equivalent of the number a™ 1) will enable us to do algebra with 


' A consequence of the 


associative law, 

A(BC) = (AB)C, is that we 
can omit the brackets where 
appropriate and write ABC 
without fear of ambiguity. 
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matrices in much the same kind of way as we do with real numbers. The only 
difference will be that we shall have to be careful not to interchange expressions 
like AB and BA. 


2.4 The product of a matrix and a column vector 


The product of a matrix and a column vector produces another column vector— 
for example, | 


Z 7 1 1 =. 

3 4 2 2) FOE, 

5 QO —1 —| 6 
Exercise 9 


rm (Ceaage: 6 
= Fe cee al ea q. 


[Solution on p. 57] 


Example 5 


3 1 x 
If A = |ndx = | then 
4 2 X>5 
ay +X, 
<= 
aX 2h, 
In this last example, the column vector formed looks very like the left-hand side of 
a pair of simultaneous equations. It can be seen that any matrix A multiplied by 
an ‘unknown’ column vector x provides us with a column vector whose elements 


are linear combinations of the elements of x. This enables us to write any set of 
linear simultaneous equations in the form 


Ax = b. 
Example 6 


1 -1 xi 0 
Given that A= E A = i b= Bl 


write down the simultaneous equations represented by 
Ax = b. 


Solution 


gives us 


aia ant Ls 

2x 42+ 3x, =e 

By definition of equality of matrices, this gives us the simultaneous equations 
: eee es, 
aX OX, = 5. 


Exercise 10 _ 
Write down the simultaneous equations represented by Ax = b 


3 Lo .=—2 x, 2 
ifA=j| 1 Z by, KS) x, |, b= Se 
4 1 0 <s — | 


[Solution on p. 57] 
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Example 7 
Express the equations 
2x, + x, =3 
x, — 3x, = 5 


in the form Ax = b. 


Solution . 

Reversing the procedure of Example 6, the left-hand side of these equations can be 

written as 
2 1 |) xy Note that A is the matrix of 
1 —3]]/ x, | the left-hand side coefficients. 


So the equations above can be written as 


Z Lijx,] 13 
1 —3]|x,] 15 
which has the required form Ax = b. 


Exercise 11 
Write the equations 


x, — 3%, + X,= 


| 
NO — 


ne + aX, + 5X3 = —| 


in the form Ax = b. 
[Solution on p. 57] 


The reason that this notation was not used in the unit on simultaneous equations 
was that the operation of matrix multiplication had not been discussed at that 
time. 


There is another context in which linear combinations and matrices arise. Suppose 
that we have two pairs of numbers (x,,y,) and (x,,y,) such that 


X5 — Xy +> sa 


(1) 
yz =4x,+ Jy 


These equations allow us to put values of x, and y, in the right-hand side of (1), 
and hence calculate the values of x, and y,. In a sense, we have ‘transformed’ 
(x,,¥,) into (x,,y,), so a set of equations, like (1) above, is called a 
transformation—in this case a linear transformation. All linear transformations 
may be written in matrix form. For instance, the Equations (1) above can be 
written as 


Eee iT; 


Thinking of transformations in this matrix form enables us to manipulate them 
more easily. For instance, suppose we know that 


X2 My + sae (1) and x3 ae 3X4 ae | (2), 
yz =4%,+ VM y3= X,+2y, 


and we wish to express x, and y, in terms of x, and y,. 


We know that (1) and (2) above can be expressed in matrix form as 


Plc bi} = [s]-[) “p] « 


Then we can replace the expression for the column vector a from (3) into (4), 
Z 
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a tier) Cem 


using the associative law for matrices. Thus x, and y, can be expressed more 
simply in terms of x, and y, as 


y 9 4iLy, | 
Rather more generally, suppose we perform a sequence of linear transformations of 
this kind: 


ae!) where r, =| -* |. 
r, = Br, (4) Yi 
Then by substituting the expression for r, in (3) into (4), we get 
r, = B(Ar,) 
Bie (BA)r,, 


and hence we obtain expressions. for x, and y, in terms of x, and y,. 
You will need this concept of combining successive matrix transformations using 
the multiplication process to understand what is happening in the television 
programme. 


Exercise 12 


=e: ee a4 1 x 
fA= B= = =| "+ 
— ; aa f 7 : 1 | gre, | 


express the following succession of matrix transformations: 
r,=Ar,, r,=Br, and r,=C, 

as a single matrix transformation 
r, = Qr,. 


Find the matrix Q, first in terms of A, B, C and then as a matrix with numerical elements. 
[Solution on p. 57] 


2.5 The matrix transpose 


The transpose of a matrix A (written as A’ in this course, or 
sometimes as A’) is simply the matrix whose columns are the 
rows of A. 


For instance, if 


then 
20 
AT=|1 4 
53 


You can see that the first row of A becomes the first column of A’ and the second 
row of A becomes the second column of A’. 


Exercise 13 
3 a 
If A = and = =B=; 0 — 3}, 
eee | eee: oS 


write down A? and B’. 


MST204 20.2 19 


Exercise 14 

Show for the matrices in Exercise 13 that 
(AB)? = B’A’. 

[Solutions to Exercises 13 and 14 on p. 57] 


The result 
(AB)? = B7A? | (5) 


is always true, although it is not proved in this unit. We shall assume this result 
and show that a similar result is true for three matrices. 


Example 8 

To show that (ABC)! = C™BTA’. 

Proof (ABC)? = (A(BC))? 
= (BC)’A? using (5) 
= C™BTAT using (5) 


This is just another example of how to use matrix algebra. 
The general result is 


(A,A,...A,)" = ATAT_,... ATZAT. 


An example of the use of a matrix transpose is in the theoretical matrix 
description of the linear programming problem in Unit 10. A typical linear 
programming problem would be: 
Minimize c,x, + CX, + €3X3 
subject to the conditions 
Ay 4X1 + 4y2X <b, 
ao4X5 + A53X, = b, 
X1,X2,X3 2 0. 
In matrix terms, this can be expressed as: 
Minimize [c, c, c3][x, 
X2 
x3 


subject to the conditions 


hicks, Xs O: 
Rather more briefly, we say: 
Minimize c’x 
subject to the conditions 
Ax <b 
and x>0. 
We are forced to use the notation c’ as, to save eantesion. we stated at the 


beginning of the unit that the notation ¢c refers only to a column vector. So if we 
need a row vector, as in this case, we use the notation c’. 


Finally, note the economy in expressing our linear programming problem in terms 
of matrices. You will find that matrix notation gives us an excellent short way to 
express and to simplify problems. 
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Summary of Section 2 


1. The matrix C = AB can be formed if the number of columns of the left-hand 
matrix A equals the number of rows of the right-hand matrix B. 


The product matrix C will have the same number of rows as A, and the same 
number of columns as B, and its elements are given by 


Cig = 4;40,; + Gigb2, + +++ + indy 
n 
= 2 Aid; 
k=1 


where n is the number of columns of A. 


2. The commutative law for matrix multiplication is not true. So, in general, AB 
is not usually equal to BA. However, the multiplicative associative law 


A(BC) = (AB)C 
and the distributive laws 

A(B + C) = AB+ AC 
and 

(B+ C)A=BA+CA 
are true. 


3. The product of a matrix and a column vector is another column vector. This 
enables us to express: 


(1) simultaneous equations in the form 
Ax =b; 
(ii) linear transformations in the form 


Xk 
r, = Ar,, where r, = : 


Yk 
4. The transpose of a matrix A is the matrix A’ whose columns are the rows of A. 
The main property of transposed matrices is 


(A,A,...A,)" = ATAT ,... ATAT. 


End of section exercises 
There are more end of section exercises than usual. This is to ensure that those of 
you who haven’t met matrix multiplication before get adequate practice. 


~ Exercise 15 
Given the matrices 


A= : 4 b Hi C = : A D ; ; 
—1 2 | 1 0 1 2 ) 3 ; 
form the following products where possible: 


(i) Ab (u) AC (11) AD (iv) bC (v) b7C | (vi) DC’. 


Exercise 16 


Show that the associative law A(BC) = (AB)C and the two versions of the distributive law, 
A(B + C) = AB + AC and (B + C)A = BA + CA, hold for the matrices 


2 0 1 1 2 —1 1 0: =1 
A= |-1 a i; B= |-1 2 +1, C= 0 ps 0}. 
3 4 -2 5 4 1 J 0 2 


(You have already found AB and BA in Exercise 7.) 
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Exercise 17 
Show that the distributive laws stated in the last exercise are true for the matrices 


ps 1 1 0 
7 ae 
A= 5 : ‘ B= |0 a5 C=|2 3 |. 
= 5. 4 4 -1 
Why is the associative law not applicable? : 


Exercise 18 
Express the simultaneous equations 


2x, + 3x, + 4x3 = -1 
Xx, —2x,+5x,= 0 
X.—2x,= 10 

in the form Ax = b. 


Exercise 19 
Combine the transformations 


bits bik At bales 
Yat bs HI, Ver ay 
into a single transformation, giving ee in terms of pe 


3 1 
(Solutions to Exercises 15-19 on pp. 57-58 | 


3. Change of axes 


3.1 Change of axes in two dimensions (Tape Subsection) 


This subsection is a tape subsection in preparation for the television programme. 
Start the tape now. 
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The new 
axes 


Given the co-ordinates (x,y) of a point P with respect 
Co some asxces, what are the co-ordinates Ce; uy‘) of 
this point with respect to some new axes ? 


& The strategy 


Split the problem up 


Put stages | and 2 together : 


‘ Rotation ee 
= 4) ee ay) y') 


Stage | Stage 2 
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What are the co-ordinates of 
P with respect to the 
O'uv axes ? 


Answer in Ghe 
| back of the text 
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oO : A Q 


= OA+ AR 


= OAT AS 


SS. SINK 


Se 


& The rotation matric 


7 
y 


=—wwW sind +t V Cos 


The equivalent matrix equation : 


are . the 
roration op Sos - _ 
matrix nen ao 
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> Exercise 2 


Cosa Sin - 


-Sine< Cose 


What are the Coordinates of P 
with respect to the Oxy axes ? 


& Different point of view 


pP(uv) 


a 4 (7) P' has coordinares (u,v), where 


U Cos &« -SIN &K 
Vv Sine 


(y) P has coordinates (u.w) where 
i Cos (-«<) - Sin (-c) 
v sin (-oL) Cos (-«) 


Cos & Sin & 
—Sin < 


(c) The new Co ordinates Co, of Pare 


U Cos Sin of 
Ww —Sing 
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The answer 
in terms of 
x and uy 


What are the coordinates of P 
relative to the O =x’ 7 axes” 


Answer in the 
back of tex€ 
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3.2 The re-orientation problem (Television Subsection) 
Read the following notes before viewing the television programme. 


In the television programme, we solve a three-dimensional problem which is the 
equivalent of the two-dimensional change of axes problem in the tape subsection. 
In Figure 1 below, for instance, we require to find the new coordinates (x’, y’,z’) of 
P with respect to an origin O’, given that P has coordinates (x, y,z) with respect to 
the origin O. 


y’ P (x,y,Z) 
(x’,y’,2’) 


Figure 1 x 


In the television programme, O is the planet Earth, P is the planet Saturn, and O’ 
is the space probe Voyager. The displacement vector OP of Saturn relative to Earth 
is referred to as s, and the displacement vector OO’ of Voyager relative to Earth is 
referred to as Vv. Thus from Unit 14 on vector algebra, we know that the 
displacement vector O’P of Saturn relative to Voyager will be s — v. These vectors 
are marked in Figure 2. 


: P (x, y=) 
oe ae a 


Figure 2 


At the beginning of the programme, you will see an explicit set of equations which 
determine the new coordinates, written without the use of matrices. You will see 
that these are very long and messy to write down, and have no obvious structure. 
The programme derives a matrix solution to the problem which is (a) more readily 
understandable, (b) easy to derive from first principles, and (c) allows us to see 
some structure in the resulting solution. 


When you watch the programme, you should concentrate on the strategy, rather 
than on the detail of the matrices. These are all given in the post-programme 
notes. 


Now watch the television programme: ‘Applying matrices—the algebra of re-_. 
orientation’. 


Read the following notes after watching the programme. (They summarize the 
programme, and could help to serve as a substitute for it if you have not been able to 
view it.) y 

The problem in the programme is the three-dimensional equivalent of the problem 
on the tape. (See Figure 1 above.) 


P (the planet Saturn) has a known set of coordinates with respect to a set of axes 
Oxyz, centred on Earth. Given another set of axes of known orientation whose 
origin is at a known point O’ (the space probe Voyager), what are the coordinates 
of P with respect to these other axes? 


As before, the problem can be broken down into stages. 


TV20 
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Stage | 


Itisa relatively easy matter to get the coordinates of P with respect to axes at O’ 
which are parallel to the original axes. We call these intermediate coordinates 


u, UV, W. 
P (x,y,z) 
: (u,v,w) 
(a,b,c) 
Figure 3 x 


These_can be found by finding the vector O’P in terms of the known vectors OP 
and OO’. Using the vector rule of addition, we know that 


—_—>' —->-—sMs => 
O'’P= OP — 00. 
In terms of column vectors, this gives us 
u x a 
vi=ly{—]bi. (1) 
w Z C 


Stage 2 = 
The rotation is not the simple problem that we had in two dimensions. 


y 


(x y',2’ 
The alignment of the O’uvw axes with the O’x’y’z’ axes (see Figure 4) can be done 
in various ways. The way we use is attributed to Euler. The strategy of this 
method is to break the rotation down into three simpler rotations, each of which 
holds one of the coordinate axes fixed. The first rotation we make does not 
actually align any of the axes, but positions the axes in such a way as to ensure 
that the two subsequent rotations will perform the entire re-alignment. The 
advantage of breaking the problem down in this way is that we know a formula 


for two-dimensional rotations. Figure 4 


t , FP (u,v,w) 
! 
| 


Imagine that the w-axis in Figure 5 is sticking up vertically into the air, and the uv-plane 
is the ground. If a rotation is made, keeping the w-axis fixed, the rotation will not 
change the w coordinate, as the distance of the point above the ground will not change. 
This, and the results from the tape subsection, give us 


u= ucosa+vsinag 
—usina + v cosa 


Ss 
I 


w= W. 


of / 


In matrix terms, we get the coordinates of P with respect to the axes O’u'v'w’ as 


/ 


COS & Sin a 0] fu 
v’ | = |—sinae cosa OQ} 04 (2) 
w’ 0 0 1JLw 


If we do a subsequent rotation, this time keeping the u’-axis fixed, and rotating 
through an angle £ in the v'w’-plane, then Figure 6 shows that the u’ coordinate 
will not change, and we get yet another set of coordinates for P, with respect to 


1 fora | piel sd 


another set of axes, O’u"v’ w”: 


/ 


u 


u’’ — 
v’= v’cosB+w’sinB 
wt es 


= —v’sinB + w’cosB. Figure 6 
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This can be expressed in matrix terms as 


a 1 0 0 uy 
vp’ | =]|0 cos B sin B | | vu’ |. (3) 
Ww" 0 —sinB cos B} | w’ 


If yet a third rotation is done, keeping w’’ fixed, and rotating about the w’’-axis 
through an angle y, then the w’”’ coordinate will not change. Similarly to the first 
rotation, we obtain the coordinates of P with respect to yet another set of axes 
oe aa ak ad * ° : 


* Cos sin y O}}u’ 
vy’ | = |-siny cos y OL ror. (4) 
w’' 0 '@) 1 w”’ 


Suitable choice of the angles a, 6 and y (sometimes called the Eulerian angles) will 
ensure that the O’u"’c""w” axes thus obtained are the required O’x'y’z’ axes, and 
thus that the coordinates (u’”’,v’”’, w’’’) are the required coordinates (x’, y’, z’). How 
these angles are chosen is shown in the programme, and illustrated in Figure 7 


below. (To avoid a confusing figure, the v’ and v’’ axes are not shown.) 


Figure 7 


The uv-plane and the x’y’-plane will intersect in a straight line. (Figure 7 shows 
two circular portions of these planes, centred at the origin, intersecting in the line 
marked O’u’.) The three rotations are carried out as follows. 

Step 1 : 

Rotate about the w-axis until the u-axis is in the line of intersection of the uv- and 
x'y’-planes. The « is the angle of this rotation. (The new u-axis is called the u’- 
axis.) 

Step 2 

Rotate about the u’-axis until the w’-axis coincides with the z’-axis. Then PB is the 
angle of this rotation. 

Step 3 : 

Rotate about the w”-axis until the u”- and v”-axes coincide with the x’- and y’- 
axes respectively. Then y is the angle of this rotation. 


We are now in a position to find an expression for the coordinates (x’, y’,z’) in 
terms of the coordinates (u,v, w). We know that 


/ tg 


y’ pas M yl" | (4) 
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u"’ u’ 

a = M, Gi (3) 
Ww” Ww’ 
u’ u 

alee: ° 
Ww’ Ww 


and M,, M, and M, are the rotation matrices on pp. 28-29. 


and 


Combining these transformations in the way we did in the last section, and using 
the associative rule, we obtain 


ee u’ 
A = MM, | from (3) 
2 w’ 
U 


= MMM, : 


W 


| from (2) 


u 
= mM (5) 


W 


Thus we can multiply M,, M, and M, together to obtain the required relation 
between x’, y’, z’ and u, v, w in the form 


where 
M = M.M.M,. 


= inally, we know that 


u 
zB 
. Lw 
x—a 
rd 
Z—C 
where (x, y,z) are the coordinates of P relative to the original origin O. So, 


substituting this into the matrix equation (5) above, we obtain the new 
coordinates for the point P in terms of the old coordinates: 


46" x-—a 
| = M,M,M, > | (6) 
2. zZ—C 


Post-television work 


(1) 


I 
‘SRE, 
ie ae. 
Be 
| 
crn 
oS. Se 
Marea. 


In the following exercise we ask you to do the final step yourself, thus returning 
full circle to the beginning of the television programme. 


Exercise 4 


Obtain three explicit equations, giving x’, y’ and z’ in terms of x, y, Z, the a 
coordinates a,b,c, and the Eulerian angles «, B, y. 


[Solution on p. 59] 


End of television programme notes 
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Summary of Section 3 


i. Given that the coordinates of P are (x,y) with respect to some axes Ox, Oy 
with their origin at O, then the coordinates of P with respect to another pair of 
axes O'x’, Oy’ with their origin at (a,b) and inclined at an anticlockwise angie OL 
to the meee axes (see Figure 8) are given by 


x =. COS a sina ||x—a 
y'| | sine  cosaj|/y—b| 


2. In three dimensions, if we wish to move the origin to the point (a,b,c) and re- 
orientate the axes (see Figure 9) the new coordinates of P are given by 


+ x-—a 
| = M,M,M, > o | 
z= Z—C 


where a, 8 and » are the Eulerian angles of the rotation and M,, M,, M, are 3 x 3 
rotation matrices each describing a rotation which holds one of the intermediate 
coordinate axes fixed. 


4 Square matrices and their inverses 


4.1 Square matrices 


In this section, we will restrict ourselves to the algebra of square matrices. The 
reason for doing this is that: 

(i) there are important results which are true only for square matrices, and 
(ii) real-life situations often produce square matrix blocks. 


Square matrices of the same size can be both added and multiplied, and thus 
satisfy all the laws of algebra discussed in the first part of this unit. Also, suitably 
sized zeros (0) and negative elements (— A) are already defined. But before we can 
use matrix algebra in a way similar to real number algebra, we still need to find a 
matrix analogous to the number 1, and another matrix analogous to the number 
a~'. This section discusses these matrices, and the resulting matrix algebra. 


(a) The unit (or identity) matrix 


This is a square matrix whose diagonal elements a,,, a,,...a,, are all ones, and 
all other elements are zeros. (The ones are said to lie on the main diagonal, which 
Starts in the top left-hand corner.) Examples of unit matrices are 


1 0 0 
i i and | 0 1 0 |. 
0 0 1 


Matrices such as these are referred to as I. The size of I is not normally specified, 
as this is usually clear from the context. 
Unit matrices have the important property that, for any square matrix A, 


AI = IA =A. 


The following example demonstrates this property for one particular matrix. 


Example 1 


2 1 
If A = 
A 7 " then 


and 


x 


Figure 9 
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(b) The inverse of a square matrix A 


A basic property of our real number system is that for each real number a (except 
for 0), an inverse a~' exists so that | 


a 'a=1. 


Without this property, it would be impossible to solve equations. This leads us to 
ask whether an inverse matrix A~! exists so that. 


AA =I. 


Example 2 


From the tape subsection, we know that if the x- and y-axes are rotated through 
an angle a, then the new coordinates of a point P will be given by 


a e COs a sh | (1) 
—sine  cosa|ly 


Now, starting with the co-ordinates (x’, y’), we know we can ‘undo’ the process 
above by rotating through a clockwise angle «. So we know that 


x cos (— a) sin (—a) |] x’ 
i = oe (—a) cos ea 
COS —sin« || x’ 

x ea m4 cos || 


Substituting this expression for * into (1), we get 


x cosa sina |[cosa  —sina ]f x’ 
y | |-sine cosa || sina cosa |] y’ | 
Thus the matrix product 
COS a sina || cosa —sin a 
—sina COS @ || sin a COS & 
/ 


leaves all vectors P fixed and must therefore be the identity matrix, I. So, if we 


, 


call the original rotation matrix M, then the matrix which ‘undoes’ the rotation 
must be M~. 


Exercise 1 
Given that 
Cos SI co —sin 
m-| os a 2a cS N=| ome S a 
—sin COS & sin COS a 
verify by multiplying the matrices together that MN = NM = 1. 
Exercise 2 


Given that 


<_ a b d B 1 d —b 
le d ei Gd —bel—e a 


where ad — bc £0, then verify that 
AB = BA = 1. 


[Solutions to Exercises 1 and 2 on p. 59] 


From this last exercise, it can be seen that if ad — bc is not zero, the 2 x 2 matrix 


roe 1 d —b : 
“ga — bel —c a (2) 


O 
Figure 1 
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such that A~'‘A = I. 


When the inverse of an n x n matrix exists, 


A A= AA * =I 


(3) 


In 2 x 2 matrices where ad — bc is zero, then no: inverse exists; but whenever a 
matrix A has an inverse A~’ then the commutative property (3) is true. This will 
be proved more generally in Subsection 4.4. 7 


Example 3 


2 1 1} 4 -1 Z 4 
U Equat 2), the 1 f iS = . 
sing Equation (2), the inverse o \ a 1S | = sf but 4 4 and 


0 | 
i i have no inverses, as, in both cases, ad — bc = 0. 


Exercise 3 
Using Equation (2), write down the inverses (if possible) of the following matrices: 


oe, Eee : 1 —1 oe Cees | 
(i) 5 | (11) E 5 - (111) 1 A 
[Solution on p. 59] 


One use of the inverse matrix is in the theoretical simplification of matrix 
expressions, and in the theoretical solution of matrix equations. 


If we are given a real number equation 
ax Sk 

then the theoretical solution of this equation can be written 
Xa 8. 


Similarly, if we are given a set of simultaneous equations, represented by the 
matrix equation 


Ax = b, (4) 
then we can also find a theoretical solution of this equation in the following way. 


Assuming that A~’ exists, we can multiply both sides of Equation (4) on the left 
by A? to give | 


A Ax =A S 
So 

Ix = A7'b as A~1A =I, 
1.e. 

x=A'b as Ix = x. 


It is in this kind of way that we use A‘ to obtain theoretical simplifications of 
matrix expressions, or theoretical solutions of matrix equations. In many problems, 
this is as far as we need go. However, the next subsection discusses a method for 
finding the numerical value of A~’, when this is necessary. 


4.2 A method for finding A~ : 


From Subsection 4.1, we know that the theoretical solution of the simultaneous 
equations 


Ax =b (5) 
1S | 
x=A ‘hb. 


So, a method for finding A~* would be to solve the equations (5), as the following 
example demonstrates. 
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Example 4 
ee | 0 2 | 
Find the inverse of the matrix 5 | by solving the equations 
Xy + Ox 2 = b, E 1 
2x, + 3x, =), E, 


Solution 
Using Gaussian elimination we get 


x, + Ox, =b, E, 
E, — 2E, Ox, + 3x, = b, — 2b, E. 
This gives the solution | 
E,,7+ 3 X_ = 3b, — 3b, 
x, =b,. 


In matrix terms, this means that the set of equations 


y ed-Lh 


which is of the form Ax = b, has the solution 


Mpeen ts 
X2 = 3 ILb2] 
which is of the form x = A~'b. 


| | 1 Ot. 
So, by solving the equations Ax = b we find that the inverse of 5 1S 


ae i ee 
3 = 


As you know from Unit 9, the elimination process can be done by operations on 
the rows of the matrix Alb. In fact, the whole operation of solving the equations, 
back substitution included, can be done by matrix row operations very similar to 
those used in Gaussian elimination, as the following example demonstrates. 


Example 5 
Once again, we shall find the inverse of a 2 x 2 matrix, namely 


i al 


This time, each step will be written down in matrix form. 


Working Matrix equivalent 

To solue To solve A x = 
X; + 2x, =), E, 1 ey | 
Xi + 4x, = b, E, 1 4 X2 


Note that b can be written as Ib. 


z= 


The elimination 
xX, + 2x, =), Ey. 1 2 |) x, 
E, — E, Ox, + 2X5 =n b, os b, E>, R, —_ R, 0 = X> 


The matrix equation (6) is just a matrix form of equations E, and E,,. However, 
with hindsight, we know we could have obtained both these matrices simply by 


subtracting R, from R,, which is the exact equivalent of subtracting E, from E, in 


the equations. 


(5) 
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To continue with the process of solving the equations: 
Working Matrix equivalent 


First, find x. 


x, + 2x, =5, E, 1 2x, | 1 Oj}, 5,] R, 
| ee Ox, + x, =+(b, —d,) En, R,+2 [0 I1ffx,| |-4 4b] Ra 


Again, the process of finding x, by dividing E,, by two could have been achieved 
by dividing R,, by two in both matrices. 


Now, because we are limited to row operations, we make a slight departure from 
the normal procedure of back substitution. Instead of substituting the value we’ve 
found for x, into E, to find x,, we subtract a suitable multiple of E,, from E, to 
eliminate x, from the first equation. This is in fact exactly the same as back 
substitution, but it is arranged in a different way. 


Working | Matrix equivalent 
Now eliminate x, from E,. 
E, — 2Ex x, = bh, ~.—)) R, — 2Ry P Fl -| 2 Sih Ri, 
X5= 4(b, a b,) En» 0 1 X5 —4 $ b, Ry, 
Once again, it can be seen that the process of subtracting twice E,, from E, could 
have been done directly by subtracting twice R,, from R, in the two matrices. In 


this example the process is now finished. (In a different example, we might still 
have had to divide E,, (or R,,) by a factor to obtain x,.) This gives us that the 


inverse of ; . is es 
ares 4) 


The point of this example is that the entire operation of finding the inverse of a 
matrix can be done by row operations simultaneously performed on A and I. 


Starting with the matrices 


A IL, 


perform the same linear row operations simultaneously on these 
matrices. If A is ‘reduced’ to I, the same row operations will 
change I to A7?. 


Thus, the example above could be done entirely in terms of matrix row operations 
as follows. | 


yee Soe 

2 Eg, 
} 4 0 N R, 
ae t- 6. 8, 
R, —R, & 2 | —1 ‘ R>, 
ee. 1 O|]R, 
ere eee ire 
R, —2R,,/1 0 2 -1]R,, 
é i | = 1 R», 

eS ee 


It is advisable if doing these problems by hand to check that the matrix you have 
found really is the inverse, by calculating A~ 1A or AA“?. 


This method for finding the inverse is quite systematic, and like Gaussian 
elimination, it is suitable for implementation on a computer. 
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Example 6 
Find A~', given that 


1 2 3 
A=| 2 5 8 |. 


1 1 7 
Solution 
Start with A | I 
| 2. 3 I 0 0] R, First get rid of the elements 
E 3 3 , R, ————a 
1 1 7 0 0 1) R, 
1 2 3 1 0 0] R, 
R, — 2R, ° 1 2 —2 1 ; R,, 
RR, 0-1) 41 1 8 1 
1 2 3 1 0 0] R, 
bs ae —2 1 } R,, 
R48, 00 0+. 4 <2 1 i? Es. 
; 2 3 1 0 0] R, Now do the equivalent of 
c 1 2 —32 1 ] R,, | back substitution. 
R,, + 6 0 0 § —5 4 1| R,, 
R, — 3R;, | 1 Zz 0 3 -+ -4)R,, 
R,, — oa 1 0 = : 4 R,, 
6 Oo 114 ¢ dR, 
R,, — 2R,, | 0 0 es +1 Rip 
0 I 0 oe 3 = R», 
0 0 I = 6 6 R3, 
Finish with I | i 


(The way we have labelled the rows above keeps track of how they change. Thus 
the first time a row operation is done to R,, for example, it becomes R,,; the 
second time it becomes R,,. Once you have acquired some practice at doing row 
operations like this, you will probably find it unnecessary to keep labelling the 
rows.) 


Exercise 4 
Use Gaussian elimination-type operations to find the inverses of 
CPS Sco (ii) | 1 3 0 
F | 0 1 3 
3 0 1 


[Solution on p. 59] 


4.3 The existence of A~! 


As you saw in Subsection 4.1, not all matrices have inverses. If we take a matrix 
which we know has no inverse, and attempt to use the method in Subsection 4.2, 
we shall find out how it breaks down. 


_ Example 7 
Suppose we attempt to find the inverse of 


aa 
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(for which ad — bc ¥ 0). Using the method described in Subsection 4.2, we get 
a I 
1 2 | 1 0; R, 
z 4 0 1|R, 
1 2 | 1 0; R, 
R, — 2R,| 0 0|-2 11 e.. 
At this point, we have to stop, as there is no multiple of R, [0 0] which we can 
subtract from R, to reduce A to I. In fact, if ever a row of zeros occurs in this 


way, we are not going to be able to complete the process to find the inverse. This 
means that the condition for A~' to exist is precisely the same condition that 


Ax =b 


should have a unique solution—namely that the rows of A must be linearly 
independent. 


You might get the feeling by now that everything seems to go wrong if the rows of 
a matrix are linearly dependent. This wouldn’t be far from the truth! Since nearly 
every proof in linear algebra assumes that the rows of A are not linearly 
dependent, some terminology is needed to indicate briefly whether this is so. 


A singular matrix is a square matrix whose rows are linearly 
dependent. Conversely (and not surprisingly), 


a non-singular matrix is a square matrix whose rows are linearly 
independent. | 


A matrix has an inverse if and only if it is non-singular. 


As you saw in Subsection 4.2, the problem of finding an inverse is effectively the 
same as that of solving a set of simultaneous equations. Thus, the discussion about 
rounding error and ill-conditioning in the unit on simultaneous equations can be 
applied to the problem of finding an inverse. Partial pivoting is used in an effort to 
minimize induced ill-conditioning and the subsequent build-up of error. However, 
we are not going to enter further into these questions in this course. 


4.4 Matrix algebra 


The laws we discussed in the earlier sections of this unit, and knowledge of specific 
matrices such as 0, I and A~', enable us now to do some matrix algebra (as 
compared with matrix arithmetic). So long as we only use known facts about 
matrices we may manipulate the matrix blocks without any regard to their actual 
contents (unless we need to, of course!) 


This subsection consists entirely of examples of matrix algebra, and some exercises 
for you to practise with. 


Example 8 
Multiplying out the expression (A + B)?, we get 
(A + B)* = (A+ B)(A +B) 
= A(A + B) + B(A + B) 
= A’ + AB+ BA + B?. 
It is important to note that this is as far as we can go, as in general AB does not 


equal BA, so AB + BA cannot be written as one term, as is the case for real 
numbers. However, there is no ambiguity in referring to AA as A”, etc. 


(using the distributive law) 


Exercise 5 
Expand the expression 


(A + B)(A — B). 
[Solution on p. 60] 


Partial pivoting is discussed in 
Section 3 of Unit 9. 
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Example 9 
We took as our definition of the inverse matrix that 
AUA=IL 


Now we want to show that if A~'A = I, then AA! = I, which was stated without 
proof in Subsection 4.1. The argument can be set out as follows. 


We assume that there is some matrix Q such that 
AQ =I. (7) 


We now want to show that Q = A™’. To do this, we multiply both sides of (7) on 
the left by A7?. 


A~'(AQ) = A“'FI, and so 


(A~'A)Q =A," 
by the associative law and the properties of I. That is, 
Q=A". 


So we have proved that for any square matrix A which has an inverse A~* such 
that A~'A =, it is also true that 


AA * =I. 

Example 10 

Show for non-singular matrices A and B, that 
(AB)! =B+A™*. 

This means that we have to show that 


(B-1A~')(AB) = I. 


Solution 
(B-'A~')(AB) = B~'(A~*A)B (using the associative law) 
= BIB (since A~*A = I) 
= BB 
=] (since B~'B = I). 


So the inverse of AB must be B™/A~?. 


Exercise 6 
Show for non-singular matrices A, B and C that 


(ABC)"' =C"'B-'A™?. 
[Solution on p. 60} 


It can be shown by induction that for non-singular matrices A,, A >,..., Ay, 


Exercise 7 (short!) 

(i) What is the inverse of A~'? 
(ii) What is the transpose of A’? 
[Solution on p. 60] 


The last example in this subsection concerns a type of matrix called a symmetric 
matrix. 


A symmetric matrix A is defined to be one for which 
A’ =A. 
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An example of a symmetric matrix is 


si} 


and, as you can see, it is symmetric about its main diagonal. 


Example 11 


Show that any matrix Q has the property that QQ’ is symmetric. (Note that the 
product QQ’ can always be formed, as the number of columns of Q is 
automatically equal to the number of rows of Q’.) 


Solution 
We need to show that (QQ’)’ = QQ’. Now, using the fact that (AB)’ = B™A™ 
(see Subsection 2.5), we get 

(QQ)? = (Q7)"Q? 

= QQ’, since (Q”)” = Q from Exercise 7(ii). 

So the result is proved. 
Exercise 8 
Given that A? = I, show that 

(I+ A)(I— A) =0. 
Warning: A? =I does not necessarily mean that A = +I. You can check for instance that 

1 

F a has this property. 


Exercise 9 
Show that for any non-singular square matrix A, 


fe aie a (A>?) 
Exercise 10 
(1) Show that if A is any square matrix, then (A + I) and (A — I) commute, i.e. 
(A+I1)(A—I)=(A-—D(A+)D). 


(ii) | Use this to show that if (A + I) is a non-singular matrix, then (A + I)~' and (A — I) 
commute, i.e. 


(A+T1D (A-D=(A-D(AGD“. 
[Solutions to Exercises 8-10 on p. 60] 


Summary of Section 4 
1. Definitions 


(i) The unit (or identity) matrix I is a square matrix (whose size depends on the 
context), with ones down the main diagonal and zeros everywhere else. For 
any square matrix A, 


AI = IA =A. 


(i) A singular matrix is a square matrix whose rows are linearly dependent. A 
non-singular matrix is a square matrix whose rows are linearly independent. 


(iii) The inverse matrix A~‘ of a non-singular matrix A has the property that 
A-'A=AA't=L1 
Singular matrices (and non-square matrices) have no inverses. 
(iv) A symmetric matrix A is one for which 
A’ =A. 


2. Computation of A7! 


The problem of finding A~* is effectively the same as solving the set of equations 
represented by 


Ax = b. 
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We use Gaussian elimination type operations, with partial pivoting. Suring with 
the matrix pair 


All, 


perform the same linear row operations simultaneously on both these matrices. If 
A is reduced to I, then the same row operations will change I to A~?. 


3. A property of inverses 
GA, AD =A. A A 


End of section exercises 


Exercise 11 


Use Gaussian elimination-type operations to find the inverses (where possible) of the 
following matrices: 


(i) E 7 (ii) } ce Gi 44 oe 
e127] See ee ca 
ee 


Exercise 2 

If the matrices A, B and X are non-singular and 
B=xX ‘AX 

find A in terms of B and X. 


Exercise 13 : 

Show that if A and B are square matrices such that A and B commute, then: 
(i) A? and B’ commute, 

(ii) A‘ and B~! commute, 

(ii) A and B~! commute. 

[Solutions to Exercises 11-13 on pp. 60-61] 


5 Introduction to determinants 


In the next few units, we shall be considering some applications of matrices—to 
vibrations of mechanical systems, for example. In addition to the matrix algebra 
considered so far in this unit, some of these applications involve a number 
associated with a square matrix, called its determinant. 


5.1 Definition of 2 x 2 and 3 x 3 determinants 


(a) 2 x 2 determinants 
We have seen in Subsection 4.1 that the 2 x 2 matrix 


ee 


| has ‘an inverse if and only if the number ad — bc is not equal to zero. This number 
ad — bc is called the determinant of the matrix A, often denoted by det A. ieee 
of det A, we sometimes write 


a 


where the straight lines indicate that we are referring to the number ad — bc, not 
the matrix. 


Example 1 
1 2 
{fA = 
If A } st then 
1 2 
det A = 
AG 


=(ix4)—@x 3)= -2 
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Exercise 1 
Evaluate det A if: 


. s-| 3 : : at : 
oe Py at 


[Solution on p. 61] 


(b) The 3 x 3 case 
A definition of the determinant of a 3 x 3 matrix can be given as follows. If 


Ai Qi2 a13 
A= |a), ar Ar3 |> 
431 432 433 
then 
Qi Qi2 a3 
detA = |a,, i; A>; 
a31 a32 433 
Az2 a23 Ar; az3 ar, Az2 
= Oy, — Q12 + 43 (1) 
a32 a33 a3, 433 a3, a32 


= G1 1972433 — Ay 1473032 — 412071433 + A, 207303, 
HF 43421432 — 41347243). 
This way of evaluating det A is often referred to as expanding by the top row. This 
is simply because we write down the first element in the top row (a,,) and 


multiply this by the determinant of the elements left when we cross out the row 
and the column in which a,, lies. 


arr ——-49--- 3 


4 a2 a3 


31 a32 a33 


We then move to the next element in the top row (a,,), and multiply this by the 
determinant of the remaining elements when we cross out the row and column in 
which a,, lies. 


a 
az; Qo. a3 
a31 32 a33 


This is the determinant 


We continue along the top row in this fashion until we reach the last element. As 
you can see from the definition, the signs connecting these terms alternate. A 
general n x n determinant may be defined in a similar way, but for the purposes of 
this section we will confine the discussion mostly to 2 x 2 and 3 x 3 determinants. 


Example 2 


Z 1 3 
If A =0 2 oe 
3 1 6 


42 MST204 20.5 


then using Equation (1), 


2 1 3 

det A = |0 2 1 
3 1 6 

2 1 0 1 0 Zz 
=P gl be ft Pxb ol 
= (2 x 11) —(-—3) + 3 x —6) 
=22+3-—-18 
et 2 
Exercise 2 


1 2 0 
4 1 5 


evaluate (i) detA, (ii) det A’. 


Exercise 3 
1 2 3 
Find det A if A = ‘ 5 : 
7 8 9 


[Solutions to Exercises 2 and 3 on p.61 | 


You will see some of the situations in which determinants arise in the next 
subsection, and you will need to be able to evaluate determinants in the unit on 
eigenvalues. 


5.2 Situations in which determinants arise 

Surprisingly, determinants can be used to simplify the formulas occurring in 
_ various mathematical situations. 

(a) Vector products 


(i) The cross product 


In Unit 14 on vector algebra, you saw that the cross product a x b was given by 
the lengthy expression 


a x b = (a,b, — a,b,)i + (a,b, — a,b,)j + (a,b, — a,b,)k. 
We can re-express the middle term, to get 

a x b = (a,b, — a3b,)i — (a,b; — a3b,)) + (a,b, — a,b, )k 
a3 ek Cee fa a3 ay a2 
b, b, b, b, b, b, 


This expression can be remembered even more conveniently as a 3 x 3 
determinant: 


s+ k. (2) 


i j k 
ax b= la, a, a;|. (3) 
by b, bs 


Note that we take the liberty of putting i, j and k in the top row, even though they 
aren’t numbers. This is because the resulting form is so easy to memorize. 


Exercise 4 
Use the determinant form of the cross product to evaluate a x b, where 


a= 3i—j + 4k and b=i-—j+k. 
[Solution on p. 61] 
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(ii) The triple scalar product 
The triple scalar product a: (b x c) which you met in Unit 14 can also be most 
easily remembered as a 3 x 3 determinant. 


Expressing b x c in the style of (2) above: 


b, b, ae b, b, 
C2 C3 Ci C3 


Oy By 


Cy C2 


b xe = j+ 


and the dot product of a,i + a,j + a,k with this is then 


a:(b x c) =a, 2 aa "3 + a, ee 
gc C3 a C3 Cy C> 
a, a, as 
— eee ee ee (4) 
Cy C2 C3 


Again, this is far the simplest way of writing the triple scalar product. 


(b) Areas and volumes 0) B(b;,b2,0) 
(i) The area of a triangle in two dimensions 
The area of a triangle in two dimensions can be expressed as a 3 x 3 determinant. 
From Unit 14 you know that 

">. —> —> ——>_—” 

AB x AC = (|AB||AC|sin 0)k 


O 
where the triangle ABC is as in Figure 1, 6 is the angle between AB and AC (both Figure 1 
in the xy-plane), and k is the Cartesian unit vector perpendicular to the xy-plane. 
We can also express the area of the triangle ABC as 
Area of triangle ABC = 148 IACI sin 0 
re 8 
= 5|AB x AC. eee 


Now, 
> - > 


AB x AC = (b— a) x (c— a) 
= (b x c) — (a x c) — (b x a) + (a x a) 
= (b x c) — (a x c) + (a x db) 
(by the properties of cross products) 
i j k : j k i j k 
=|b, b, ote, a, O}+/a, a, 0 
cc c 0 c; oe 0 b, b, 0 
= ((b,c, — cb) — (aye, — cya,) + (a,b, — b,a,))k. 


The coefficient of k in this determinant can be written as the determinant 


1 1 1 
= 1a, b, Cy}> 
a, b, Co 


and so from Equation (5), 


1 1 1 
Area of triangle ABC = modulus of 5]a, 6, = ¢;,}. (6) 
a, b, C3 


(We have had to write ‘modulus of’ because of the unfortunate similarity between 
the modulus sign and the above way of writing determinants.) 


Exercise 5 
Show that the area of the triangle OAB in Figure 1 is 


4/41 b, 
2 


a, b, 


[Solution on p. 61] 


4 MST204 20.5 


(ii) Volumes in three dimensions‘ 
The volume of the squashed box in Figure 2 (normally called a parallelepiped) can 
z= ae —— > ae 
be expressed as a determinant. Suppose the position vectors OA, OB and OC are 
a= a,i+ a,j + ak, 
b = 5,1 + b,j + bk, 
c=Cc,i+c,j + c3k 


respectively, and n is a unit vector perpendicular to the base OBDC of the 
parallelepiped. 


We know that the height h of the parallelepiped can be expressed as 


h = |al|cos d Figure 2 
= |a-nl, (7) 


where a, n and ¢ are as shown in Figure 2. Also, if 0 is the angle between the vectors b 
and ¢ in Figure 2, we know from Unit 14 that 


b x ¢ = (|b||c| sin 6)n, (8) 


which is the area of the base of the parallelepiped, times a unit vector n. Now, the 
volume V of our parallelepiped is 


V = height x base 
= (|a|cos @) x (|b||c| sin 6) 


= |a-n| x (|bi|c| sin 0) from (7) 
= |a- (|b||c| sin 6n)| 
= |a-(b x c)| from (8). 
Thus, using Equation (4), we derive a determinant expression for the volume: 
ay, a, a3 
V = modulus of |b, b, b,|. (9) 
Cy 9) C3 
Exercise 6 
(i) Find the volume of the parallelepiped generated (as in Figure 2) by the position 
vectors 
a=i+k, b =i + 2j and c =j + 3k. 


(ii) | Show for this particular case that 
a-(b x c)=c-(a x b) = b-(c x a). 


[Solution on p. 61] 


You can see from Exercise 6 that cyclic permutations of a, b and c gave the same 
numerical result. This is fairly obvious if we think about the way we generated the 
volume of the parallelepiped. The base was quite arbitrary. We could have just as 
easily chosen OCEA as the base, and its area would be the magnitude of c x a. So 
long as we are careful not to alter the right-handed order of the vectors, any of the 
expressions in Exercise 6 (ii) will generate the volume of the figure. 


(c) Solutions of simultaneous equations 
You can check that the equations 
A,1X; + ay2xX, =), 
A,X, + a,x, =b, 
have the solution 


_ _Gz2b, — a,b, oa 1b, — a,,b, 


= , 2 
we gs es nd ee 421412 — 411422 
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provided that the denominators aren’t zero. Writing the equations in matrix form 
Ax = b, the solution can be expressed in determinant form 

A A 
where A is det A, A, is the determinant of the matrix formed by replacing the first 
column of A by b, and A, is the determinant formed in a similar way by replacing 
the second column of A by b. 


There is a similar formula for the solution of any system Ax = b of n equations in 
n unknowns, known as Cramer’s rule: 


Sg ge ey =— (10) 


provided that A # 0. Here, A is det A, and A, is the determinant of the matrix 
formed by replacing the kth column of A by b. | 


As you will see in Subsection 5.4, it turns out in practice that Gaussian elimination 
is a much more efficient way of solving simultaneous equations. However, 
Cramer’s rule provides an interesting theoretical view. For example, expressing the 
solutions in the form of Equation (10), we can rephrase in terms of determinants 
the conditions which we found in Unit 9 for a unique solution, no solution and an 
infinite number of solutions. 


1. If A #0, the equations have a unique solution. 


2. If A =0O and one or more of the A,s is non-zero, then the equations are 
inconsistent, and have no solution. 


3. If A =0 and also A, = A, = --- =A, = 0, then the equations are linearly 
dependent, and will usually have an infinite number of solutions. 


5.3. Properties of determinants 


Determinants have some interesting and useful properties. In this subsection we 
explore some of these for 3 x 3 determinants, using Equation (4) which tells us 
that 


a, ay as 
R=) 6, 5,5, =a: x ¢) 
C, C5 C3 


and using some of the properties of vectors which you have already met in Unit 
14, and Subsection 5.2 of this unit. In particular, we shall be using the cyclic 
property which you met in Exercise 6: 


a:(b x c)=b-(c x a) =c-(a x D). ) (11) 


Property 1: interchanging rows multiplies the determinant by —1. 


To examine the effect of interchanging the second and third row, we look at 
a:(c x b). Now we know that 


cx b= —-bxe, 
SO 
a-(c x b) = —a-(b x Cc) 
= —A. (12) 
If we change the first two rows, we get 
b-(a x c) =—b:(c x a) 
= —A (from (11)). 


The following exercise completes the possibilities. 
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Exercise 7 
Show that c:(b x a) = —A. 


[Solution on p. 62] 
So, interchanging any two rows of a determinant changes the sign of the 


determinant. We shall refer briefly to this property as P1, and to the following 
properties as P2, P3, etc. 


Property 2: if two rows of a determinant A are identical, then A = 0. 
If row 2 and row 3 are identical, then 
A =a:(b x b) 
= 0 (a property of the cross product). 
If row 1 and row 2 are identical, then 
A=a:(a xc) 
=c'(a x a) _ (from (11) 
= (), | 


Exercise 8 
Show that if row 1 and row 3 are identical, then A = 0. 
[Solution on p. 62] 


Property 3: if a row of a determinant A is multiplied by k, then the value of this 
determinant is kA. 


We know that multiplying either of the vectors in a dot or cross product by k 
multiplies the result by k. Thus, 


(ka): (b x c) = k(a: (b x c)) 
= kA. 
Also, a ((kb) x c) and a-(b x (ke)) are each equal to a: (k(b x c)), which is equal 
to kA. | 


This result isn’t very important in its own right, but it helps to prove the next 
property, which is very useful. 


Property 4: if one row of A is a linear combination of the others, then A = 0. 
Suppose row | is a linear combination k,b + k,¢ of row 2 and row 3. Then 
A = (k,b + k,c)-(b xc), 
= (k,b)-(b x c) + (kc) (b x ¢) 
= k,(b-(b x c)) + k,(e-(b x €)) 
= 0 (from P2). 
Similarly,.if row 2 is a linear combination k,a + k,¢ of row 1 and row 3, then 
A =a: ((k,a + k,¢) x ¢c) 
=a-((k,a) x c) + a-:((k,c) x c) 
=k,(a:(a x c)) + k,(a-(c x c)) (from P3) 
= (from P2). 


Exercise 9 
Show that if row 3 is a linear combination of rows 1 and 2, then A = 0. 
[Solution on p. 62 | 


The consequences of P4 are wide and will be discussed in Subsection 5.4. 


Property 5: if a row of A is zero, then A = 0. 
This is just a statement of P4, if one row were zero times another one. 
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Property 6: if a linear combination of a set of rows of a determinant is added to 
another row not in the set, then the value of the determinant remains unchanged. 


Suppose we add k, times row 1 plus k, times row 2 to row 3, for instance. Then 
a:(b x (¢ + k,a + k,b)) = a-(b x c) + a: (b x (k,a + k,b)) 
=A+0 (from P4)., 
The other possibilities follow in the same way. 
The last property we discuss in any detail in this section will be proved 


algebraically without using vectors. 


Property 7: det A = det A’. 


Proof 
a; a, as 
b, b.| b b b b 
b b b a Ce a a 1 K +a 1 2 
1 2 3 1 2 3 
‘ P Z C> C3 C, C3 C1 Cy 
1 2 3 


= a,(b,c3 — b3c,) — a,(b\c, — b3c,) + a3(b,c. — b,c,) 
= a;(b,c3 — ¢b3) — by (a,c, — c,a3) + c,(a,b3; — b,a5) 


bz C2 | ay C2 a2 b, 
= a, ane b, + Ci 
b; C3 a3 C3 a3 bs 
a, b, Ci 
=e ae T 
a3 b, C3 


Property 7 has an important consequence. ea 1 to 6 can be applied to 

any determinant. In particular, these properties can be used to describe the effect 
which ROW operations on A’ have on the value of det A’ (and hence the value of 
det A, using Property 7). Now row operations on A’ are column operations on A. 
The consequence of this is that in Properties 1 to 6, the word ‘row’ could be 
replaced by ‘column’, and the statements would still be true. For example, 
rewriting Property 5: if a column of A is zero, then A = 0. 


The following example demonstrates how to use these properties to simplify the 
evaluation of determinants. 


Example 3 
Evaluate det A, where 
1 2 3 
Ae s4 5 6}. 
7 8 9 


(i) Using P6 twice, we can subtract 4 times row 1 from row 2 and then 7 times 
row | from row 3, to obtain 


1 = 3 
det A =| 0 —3 —6 
0 —6 —-—12 
Again using P6, we can subtract twice the new row 2 from the new row 3 to 
obtain 
1 2 3 | 
0 —3 —6| =0 (from P5). 
0 0 0 


(ii) Another way of evaluating this determinant would have used column 
operations. We can use P6 twice, to subtract twice column 1 from column 2 
and then 3 times column 1 from column 3, to obtain 


1 0 0 
ae 
a ee 


det A = 
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| Now, subtracting twice the new column 2 from the new column 3, we obtain 


| 1 0 0 
det A = |4 —3 0; =0 (from P5). 
7 7 —6 0 


There are plenty of other ways of showing that this determinant is zero, but the 
Gaussian elimination technique, which P6 allows us to do, is as good as any. 


A look back to P4 tells us that if the rows of A are linearly dependent then the 
value of the determinant will be zero. The four statements about a Square matrix: 
(i) the rows of A are linearly dependent, | 

(i) A is a singular matrix, 

(11) detA = 0, 

(iv) A! does not exist, 

all say the same thing in different ways. Equally well, the statements: 

(i) the rows of A are linearly independent, 

(i) A is a non-singular matrix, © 

(iii) detA £0, 

(iv) A~? exists, 

all say the same thing. We shall find in the next unit how the determinant form of 
this statement is useful. 


There is one other useful property which, for completeness, is stated in this section, 
although not proved. 


Property 8: given two square matrices A and B of the same size, then 
det (AB) = det A det B. 


Finally, although the proofs in this section only show that the properties are true 
for 3 x 3 determinants, they are in fact true for determinants of any size. 


5.4 A general method for the evaluation of determinants 


Any n x n determinant may be evaluated by an extension of the way we evaluated 
a 3 x 3 determinant in Subsection 5.1. However, in general it is much simpler to 
use a procedure analogous to Gaussian elimination. The idea of the method is to 
reduce the matrix to upper triangular form, since the determinant of an upper 
triangular matrix is very easy to evaluate. This is illustrated in the following 
example. 


Example 4 


To evaluate the determinant of an upper triangular matrix A, where 


a b Cc 
A = f d | 
0 ‘eae & 


P7 tells us (among other things) that expanding by columns is just as valid as 
expanding by rows, so we expand by the first column to obtain 


d e b Cc b Cc 
ata = d). i 6 raza ‘ 
= adf. 


It can be shown that this principle generalizes to the case of an n x n matrix in 
upper triangular form. We thus have the following result. 


If any square matrix is in upper triangular form (i.e. all elements 


below the main diagonal are zeros), then its determinant is just 
the product of its diagonal elements. 
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Now, we know frem P6 that the row operations performed in Gaussian 
elimination without pivoting do not affect the value of the determinant. 
(Remember, the only operation allowed us in Gaussian elimination without 
pivoting is to add or subtract < multiple of one row to or from another.) So, to 
find det A, use the operations of Gaussian elimination to reduce A to a matrix U 
in upper triangular form, 


Then 
det A = detU = u,,u,,...u,,. 
Example 5 
Use this technique to evaluate det A, where 
1 1 1 
A= 11 4 ao. 
: 3 8 


First we do the Gaussian elimination steps 
R,, == R, = R, and R;, sees R, con 2R,. 


We know from P6 that this will not affect the value of the determinant, so 


1 1 1] R, 
detA=|0 3 2] R,, (Stage 1(a)) 
0 1 6! R;, 
Subtracting 3R,, from R;, gives 
1 1 1 
det A =| 0 3 2 (Stage 1(b)). 
0 OS 


At this stage, we can write down the value of det A: 


gi A= 1x3 x > 
= 46. 


For large determinants the amount of work involved in evaluating det A using 
Gaussian elimination is far less than without it. 


Exercise 10 
If 


find det A (i) directly, (ii) using Gaussian elimination. 


Exercise 11 


Use Gaussian elimination to evaluate 


Dee SD 
NO W bd 
_ DH WwW 


[Solutions to Exercises 10 and 11 on p. 62] 
Finally, let us take a brief look at Cramer’s method for solving n simultaneous 
equations in n unknowns, 
A, A; A, 
x = center Xo = ee eee y a 
ee ee ee A 


where the A,s are the determinants of the form given in Subsection 5.2. To solve a 
set of equations in this way would involve the equivalent of n + 1 lots of Gaussian 
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elimination, whereas solving the equations directly only involves using Gaussian 
elimination once. So, although Cramer’s rule is a neat theoretical solution, in 
practice we would hurriedly discard it in favour of Gaussian elimination. 


Summary of Section 5 


A determinant, denoted by det A, is a number associated with a given square 
matrix A. If A is a 2 x 2 matrix, then 


a a 
det A = 11 iz 


= 4,122 — A424}. 


az4 a22 


If A is a 3 x 3 matrix, then 


Qi; 412 443 
detA = 142; 422 a3 


Q3,; 432 433 


a22 a23 a1 a3 ar1 ar2 


= aj = 


a32 a33 a31 a33 a3) a32 


Larger determinants have similar definitions. 

The most important properties of determinants are: 

(i) if the rows (or columns) of A are linearly dependent, then det A = 0, 

(ii) linear combinations of rows (or columns) of a determinant can be added to 
any other row (or column) without changing its value. 


Using this last property, an n x n determinant det A can be evaluated by reducing 
A to an upper triangular matrix U, using the operations of Gaussian elimination. 
Then 


det A = det U = U4 1U22..-Unn- 


End of section exercise 


Exercise 12 
Find det A if: 


: 4 
) a=| | ‘i 


1 2 -1 
fi) A= j-1 1 2 
2 -1 1 
1 1 Zz 3 
a 1 Z 4 -1 
(1) A= 5 3 |: 
0 1 2 -1 
[Solution on p. 62 | 
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6 End of unit test 


As a lot of the terminology in this unit may be new to you, you will find a 
glossary of new terms used in Sections 1-5, and some important results, 
immediately following the test. 


The test is divided into two sections. Section A should not take you longer than 
10 minutes. If you get any of this section wrong, and don’t understand why, go 
back and check the appropriate section in the unit before you go on to Section B. 


Section A 


There is only one correct option to each of the following questions. 


1. Which of the following is a 3 x 2 matrix? 


1 1 3 0.2 | “ 3 1 x 
(a) | 1 0 3 (b) | 03 Lt 6) 5 ' 4 a io Uae Be 

3 0 -1 0.4 1.4 Z 
(eo ge 7] 


2. Given that 


8 5 a 
2 Se ee 
3 4 rE | 


the value of s is 


(a) 24) - 4) See 2 Gt (d) 1, fe) 8x 4 2p 4 3 


5. 1A = ; 3 which of the following can be added to A? 

faj 34 4 (Db) 42 3 Oy Be beak meee (dj) 15 3 5 (e) | 1 
2 5 2 1 3 1 5 > 
3 6 


2 1 —1 ba: 

4. 3 4 + 5 J is equal to 

(a) |0 Z i ct 2 (c) [6 6] (d) |3 0 (e) |; 
b |; P |; a 9 


8 12 4 
5. 2) 4 30 2 | is equal to 
Zz 10 0 


(a) /16 24 8 i146 4A 8 ie) 186: = 1 oe 
4 
0 


8 60 4 30 2 8 30 2 
4 20 ny 10 0 4 10 0 
(d) | 4 6 2 (e) | 4 6 2 
2 15 1 4 30 Fs 
Ss 2 10 0 
V1 
6. Ifx=[x,; x2 x3] and y =| y, |, then xy equals 
Y3 


7 x4 yi XiV1 
(a) [x11 + X2y2 + X3y3] (b) | x2 y2 (c) | X2y2 


X3 3 X3Y3 


Xj X2 X3 
(d) [xiy, X22 X3y3 | (e) 
Vi ¥2 y3 
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12 13 “ 
7. Which one of the following can be multiplied on the left by 4 56 5 Mi 
(a) | 2 1 5 (b) [4 1 2] (c) | 4 3 6 8 
2 5 4 1 1 4 2 


3 P: > 5 


a2 1 3413 1 1 | is equal to 
3 0 4}}1 5 


0 
4 -2 0 
(a) | 19 4 (b) 6 5 4 (c) | 19 1 2 
2 1 i =F 4 6503 
25 —- 3 
(d) | 19 25 (e) | 6 1 3 
1 —5 3 0 0 
2 3 8 —2 0 
9. A matrix equivalent of the equations 
3X; + 2X + X3 >= 4 
x5 a5 
Mi ek = x 
iS 
la 43 2 Vix, 4 (b)> [3 X> x3]13 2 1 4 
0 0 ee se oe | 0 0 Ce 
1 1 214s 1 1 1 2 1 
Oy- tee pes 1 4 (d) | 3 2 1} = [4 1 J it x; 
X21} 0 i: —{[4 =10 0 0 1 X> 
x3|] 1 2 -1 0 1 1 2 ¥ 
(e) | 3 2 ieee 14 
0 1 —] X2 = 0) 
1 2 —1 X3 0) 
10. If Ax = b, and A is non-singular, then 
(a) x=bA' (ob) bx =A7 (c) x=A™'b (d) x=IA (¢) x= AbA 


[Solutions to Section A on p. 62] 


Section B 
a eee. 
1. IfX=11 0 0 |, evaluate 
| 0 Z 3 
i) A+1 @) XP SR Ge) Kk? a x? 
2. If A is a square matrix such that A2 = 0, show that 


A(+ A)? =A. 


3. If A,B and C aren x n matrices: 
(i) simplify (A + B) — (A + C), 


(i) show that (A + B)? — A(A + 3B) — B(A — B) = 2(B — A)B. 
4. 
(i) Find the inverse of the matrix 
1 0 0 
L=|x 1 OF. 
y Zz 1 
(u) Given that 
1 3 1 2 3 1 3 
A.= 10 3 zt B= 1 4 7) and« = 74), 
1 0 z 2 1 a7 5 
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find the column vector x with the property that 
Bx = Ax +c. 


(i) If A is a square matrix such that 


A*-A+I=0, 
show that 
A-'=I-A. 


(11) Show that 
(R~‘AR)? = R7'A5R 


where R is any non-singular matrix of the same size as A. Generalize this result. 


6. A square matrix A is said to be orthogonal if AA’ = I. 


=} ae 


oA : is orthogonal. 


(i) Show that A = 
2 se 


nd 


(ii) Write down the inverse of this matrix. 


(ii) Show that if A and B are orthogonal matrices of the same size, then AB is also 
orthogonal. 


xX} 
7. Suppose x =| : | is ann x 1 column vector whose elements are the n variables 
Xn 
X4,...,X,, and A is ann xX n matrix. 
Then the matrix 
Q = x’ Ax, 
regarded as a function of the variables x,,...,x,, is called a quadratic form. 


(i) | What is the size of Q? 
(ii) Find Q when 


b 
= is ‘Me b | 
x C d 
(iii) Find a matrix B such that x’ Bx is the quadratic form 
x; 265s 


(Note: there are several possible solutions to this part.) 


8. (More difficult) 
x} yi 

Suppose x = | : and y=j| : | are column vectors (m x 1 and n x 1 respectively), 
Xm Yn 

whose elements are variables, as in Question 7, and that A is an n x m matrix. Show that: 

(i) y"Ax = (y7Ax)’, 

(ii) (y — Ax)"(y — Ax) = y’y — 2y7Ax + x7ATAx., 

9. Evaluate the determinants of the following matrices. 


(i) & fei ta S3 (iii) pag 


2 8 ef 1 0 0 -1 
eee 3 0 
3 =2 3 


[Solutions to Section B on pp. 62-64] 
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Glossary of new terms used in this unit 


Pa ahi al cat ill Saal BU 


10. 
11. 
12. 
13. 
14. 
15. 


16. 
17. 


Matrix: a rectangular block of numbers. 

Matrices: plural of matrix. 

Column vector: a matrix with only one column. 

An m X n matrix: a matrix with m rows and n columns. 

Element: an individual entry in a matrix. 

a;;: the element in the ith row and jth column of the matrix A. 

Linear transformation: a relationship between two column vectors x, and x, 
that can be written in the form x, = Ax. 

0, a zero matrix: a matrix each of whose elements is zero. 

I, a unit matrix: a square matrix with ones down the main diagonal, and 
zeros everywhere else. 

A~'), the inverse of A: a matrix with the property A-'A = AA7* =I. 

A’, the transpose of A: a matrix whose columns are the rows of A. 

A singular matrix: a square matrix whose rows are linearly dependent. 

A non-singular matrix: a square matrix whose rows are linearly independent. 
A symmetric matrix: a matrix A with the property A’ = A. 

Upper triangular form: refers to a square matrix with zeros below the main 
diagonal. 

An orthogonal matrix: a square matrix A with the property that AA’ =I. 

A quadratic form: a single element matrix of the form x7 Ax. 


Some matrix results 


oe ee ee 


(AB) = BA’. 

(7. oy. Ce. = ey eee RA. 
(AB)! =B fA? 

(A; AS A Se A ee ee 


MST204 20 Solutions 


Appendix 


Solutions to the exercises in Section 1 


1. (i) Any matrix with two rows and two columns will 


1 2 
do—for example, F | 
1 Zn 3 4 
(ii) E 6 7 8 |is a typical example. 
: a | > 3 
1 
ae 2: os; 
(iii) 31° This is an example of a column vector (or a 
4 


column matrix). 


(iv) [1 


matrix). 


4]. This is an example of a row vector (or a row 


2. This element would be referred to as Pas. (The row is 
always specified before the column.) 


3. By the definition of equality we get 
ji) r=4,u=9,z= -3,w=4. 


(ii) ax+by=c and dx + ey= f. 
4. (i) A+B 
2 1 3 —1 + 0 
=|4 3 fio? 4 1 0 
1 5 6 0 1 1 
2-1 1+4 3+0 
={4+1 3+1 2+0 
LU 5+1 6+ 1 
1 5 3 
ee 4 2 
1 6 7 


(ii) Not possible—as A and C have different sizes. 
(iii) Not possible—as C and x have different sizes. 


) w+y+e=({|+|_j]]+ l= Li} A 


implies the equations 


2+1l=a and 4—3=b. 


6. (i) C must be the same size as A and B, so C will be an 
m X n matrix. 

(ii) cj;, the element in the ith row and jth column of C, is 
formed by adding the equivalent elements in A and B. So 


Cij a aij + b;;. 


of 


0 0 
In this case the zero matrix will be 2 x 2—i.e. | 0 | 
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8. 
co 3 
oS ae ee 1 
4 3 
7-6 ae oe 
iad =|2 1]. 
, oe eee 3 4 
eee 10 5 
‘o@ tA = 513-2 = 115-8 
a 4 20 5 


10 5 =10--<6 
(i) k(A+mB)=5A4+10B=/15 1014+] 10 0 


10: 235 — 
Gil) kA=mB=| 15= 40)=+4 2 
ee a 


12 5 
=| 13 10 
18 < 


(iv) mA +mB+kA —kB=(m+k)A + (m—k)B 


J 4 =i <6 
a7 Fie 1 


17 7 
= | 18 14 
ys 10 


10. (i) 2(A+3B)—4(A—B)= 2A + 6B—4A 4+ 4B 
= 10B — 2A. 
Gi) X+3(A+X)=2X—A 
4% 3A = 1K A 
2X = —4A 
= 236 


ea Fg oat ek 
ll. (i) @) A+B=[° agit l/s A 
6 

b) B-C=|° 


(c) Not possible. 


oxo (Jelt]-[et4 


(ii) 2(P + 2A) +B=3C: 
2P+4A+B=3C: 
P =43C — 4A —B) 


“He en EO 


Solutions to the exercises in Section 2 


1. Using Tables 1 and 2: 
(i) | John’s requirements in shop B cost 


(6 x 17) + (3 x 32) + (2 x 33) = 264p. 


(ii) | Jane’s requirements in shop A cost 

(10 x 18) + (2 x 30) + (2 x 35) = 310p. 
(iii) Jane’s requirements in shop B cost 

(10 x17) + @ x 32) + (2 x 33) = 300p. 
(iv) Joyce’s requirements in shop A cost 

(7 x 18) + (2 x 30) + (1 x 35) = 221p. 
(v) Jim’s requirements in shop A cost 

(15 x 18) + (4 x 30) + (3 x 35) = 495p. 
(vi) Jim’s requirements in shop B cost 

(15 x 17) + (4 x 32) + 3 x 33) = 482p. 


The completed table is in the main text, following Exercise 1.: 


As Fis og 
as =| 4B | 
pees = 
(1 x 3)+ (2 x —1) 


0 18 
SS Sy 
3. AB has the same number of rows as A, namely 3, and 


the same number of columns as B, namely 2, so AB is a 
3 x 2 matrix. 


sean 
(1 x 1) + (2 x 2) 


4. (i) B will have to have 4 rows to form AB (no 
restrictions on how many columns), so B will be a 4 x n 
matrix. 


(ii) B will have to have 2 columns to form BA (no 
restrictions on the number of rows), so B will be an m x 2 
matrix. . 


S. (i) AB, AC, BC, CA. (BA and CB can’t be formed.) 


(ii) AB isa 1 x 2 matrix, AC is a 1 x 1 matrix, BC is a 
2 x 1 matrix and CA is a 2 x 2 matrix. 


6. Possible matrix products are BC, BA, CB. 
(AC, CA and AB are not possible: ) 


BC = [1 -21_|- [5] 


ae oe. 
BA = [1 2; ! 


3] 4 6 
cB=|_t|u -2)=| | | 
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2 = FE 
ee ee ee ee 
4ST Ge 
Sy 
=| hb 6:8 
Se eee eee 
ea 64 
@ Me i 2 3 bi ee 
4A 4° 3 
ee ee 
S45. Gh 5 
SA es 


Note that AB and BA are not the same. 


8. It is sufficient in these proofs to show that the ijth 
element in the left-hand matrix is the same as the equivaJent 
one in the right-hand matrix. We will call such an element a 
‘typical’ element. 


(i) A typical element of A(B + C) is obtained by taking the 
ith row of A, 


[it a2 see Ain], 
and combining it with the jth column of B + C, 
b>; + C2; 
: , to obtain 
Dy Cy; 
Gj1(Dy 5 + C1;) + Gj2(b2; + C2j) Hoo + Ain(Dnj + Cnj) 


= i aix(byj + Cj) 
k=1 


= » ,D,;| + » auc) 
k k=1 


=1 
As these expressions are the typical elements of AB and AC, 


A(B + C) = AB + AC. 


(ii) A typical element of (B + C)A is formed by taking the 
ith row of B+ C, 


[bia + Cis bin + Ci2 Day Cy, | 


and combining it with the jth column of A, 


, to obtain 
ay, 
(Dix + Ci1)Qy; + (Diz + Ci2)a2; +°°° + (Din + Cin)Anj 


ss > (Dix + Cin) Oy; 
k=1 


= » bua) , cua) 
k=1 k=1 
As these are typical elements of BA and CA, 
(B+ C)A = BA + CA. 
(iii) This is the worst! 


To find a typical element of A(BC), we combine the ith row 
of A, 


[aiy eee = 
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with the jth column of BC, 


Bin Ck 


BakCr j 
k=1 


giving us the elements 


n 
"+ Ain > DakCkj 
k=1 


n n 
Git Y Dance; + G;2 yb ancey + 
k=1 Ri 
n 
au| 2: DakCkj 
eae ee 


n 
pa DigdgkCkj 
l1k=1 

n 


ez 3 qj gP ak rij 


k=1q=1 


2: | z abut Chj- 
k=1 \q=1 


I I 
sips Ms 


And this, believe it or not, is a typical element of (AB)C. 


9. 
ie 2 1 || 2 . 5 
ROG: fel 
10. 
3x, + Xz — 2x; = 2 
X, + 2x, + X3= 1 
4x, + X2 = —|] 
11. 
—3 1 X14 1 
be-=14)x5| = 24. 
3 nee —] 
A {= b 
12. pee. 
= C(Br,) = (CB)r, 
= CB(Ar, ) = (CBA)r, 
= Qr, where Q = CBA. 
* —] Lik3 1} 1 0 
= Tee a ee 
= —3 —5}}1 0 
oe ese Eo 44 
= —13 5 
ee ae St 
13. 
2 1 
arf jana = a 
| 2 
14. 


s|_[-3 0 
re ay 
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SO 
—3 0 
AB)’ = 1. 
oe OA 
12 0 -1. 
BTA? = 
5 3 7 


so for these matrices 
(AB)? = BTA’. 


(It is worth noting that A’B’ would be a 3 x 3 matrix, so 
there would be no hope that (AB)" equalled A7B".) 


15. (i) ab =| 


ee oe 2 1 —} 

(ii) AC = oe | 0 
-|"3 1 4 
oo 2 a 


(iii) AD can’t be formed. 
(iv) bC can’t be formed. 


is) C= 1 iS 


+ alll 


1 ; ; 
0 


qi 


=4 


2 2 
(vi) DC? = 5 ie - 3 
3 ~ 


16. From Exercise 7, 


i) 


No 


NO 


5 ee 
AB = 0 6 j and BA = 
6 
Also, 
2 0 1 1 
AC = 3 2 j | 0 
3 4 —-2] |-1 
1 0 0 
= -2 4 ) 
5 8 —7 
1 0 -1 
ca | 0 2 J 1 
—] 0 2 
—1 -4 3 
= E 4 2 
4 8 —§ 
1 2 -1 1 
ee 1 y ; | 0 
3 4 1} [-1 
2 4 —- 
ste - 4 i 
2 8 —I 
Thus 
(i) 5 8 —-] 1 
(AB)C = 0 6 8 0 
—7 6 7} (-1 
6 16 —-—7 
= —8 12 16], 
—14 12 21 
2 0 1 2 
A(BC) = - 2 1} |-4 
3 4 -2 2 
6 16 —7 
= | —§ 12 16 |, 
—14 12 Zt 


ao fk f 


so for the given matrices, 


(AB)C = A(BC). 


(ii) 
2 0 1 2 2 -—2 
A(B+ C)= |-1 2 1} /-1 4 3 
3 4 -2 Z 4 3 
6 8 —1 
= |-—2 10 Li. 
—2 14 
5 8 —-1 1 0 0 
AB + AC = 0 6 8} + |-—2 4 3 
—7 6 7 5 8 —7 
ae 8 —1 
= |—2 10 Se ae 
=s 14 0 
so for these matrices, 
A(B + C) = AB+AC. 
(iii) 
2 2 -—2 2 0 1 
(B+ C)A= |-1 4 3) |-1 2 1 
4 3 3 4 — 
—4 —4 8 
= 3 20 —3 |, 
9 20 0 
3 $6 35] fee 
BA + CA = a 16 —S| + |-2 4 
5 12 4 eae 
—-4 --4 
= 3 20 —3}), 
) 20 0 


so for these matrices, 
(B+C)A = BA + CA. 
17. (i) 


> 

ee) 

ll 
PS 
mem N 
© Ue 
| 
Kove sg 
eg Beane 


and 


SO 


Now, 


3 1 
2 3 ) 5 20 
ced & 0. _ ° = d 
0 
so the left-handed distributive law is true for these matrices. 


(il) 


2 1 5 6 12 

2 3 

BA = |0 3}| : AF 
~ ees ee 1 
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and 


aan | 
me NO 
oo w 
| 
ae) 
eel 
| 
N 
Nw 
co 4 


SO 


| 
ON 
+ 
~~ 
ON 
oe) 


7 9 19 
= 110 6 213 
18 27 63 
Now, 
3 1 
2 3 F 
(B+C)A=|2 6 5 ‘ | 
— | Zs 
a 9 19 
= |10 6 Zi. 
18 £2 63 


so the right-handed distributive law is true for these matrices. 


(iii) The associative law does not apply, as BC can’t be 
formed (the number of columns of B is not equal to the 
number of rows of C). 


18. 
zZ 3 4\ |x, —1 
1 —2 5 X23) = 0 
0 1 —2}) Lx; 10 
19. 


1. 
u 3 Z 1 
= - == aus 1, o = I, 
v 4 3 l 
2. 
x = cos 30° sin 30° |} 2 
yi j—sn3s@=  cos30° |, 3 
a ~3- * |\2 -| /3+3 
| =p ee 
SO 
3 
x = /3 +5 = 3.23, 
ee 
Fe ae 
Z 
RS 


He cos 45° ee 3=—2 
yt | sins" coedie ie 
1 1 1 10 
= a) ee me ec 
| 2° oe 
v2- v2- ve. 


MST204 20 Solutions / 59 


SO 
10 
x’ = —~ ~ 7.07, 
2 
8 
y =— ~ 5.66. 
We 
4. cos y sin y 0; {1 0 0 COS a SIN o& 0 
M,M,;M, = |-—siny cos 0; 10 cos B sin B — sin a COS a 0 
0 0 iy 40 —sin B cos B 0 0 1 
cos y sin y cos B sin y sin B COS sin a 0 
= |—siny cosy cos f cos y sin B —sin o COS a 0 
0 —sin B cos B 0 0 1 
cosy cosa — sinycos f sin a cosy sina + sinycos B cosa sin y sin B 
= — sin) Cos % — cosy cos f sin a —sinysina + cos ycos f cos « cosy sin B 
sin B sin « —sin B cosa cos fp 
(Alternatively, M,M,M, can be evaluated as M,(M,M,).) y =(-—siny cos « — cosy cos B sin «)(x — a) 
From Equation (6) on p. 30 we now obtain the equations + (—siny sina + cos 7 cos B cos a)(y — b) 
= (cosy cos « — sin y cos f sin «)(x — a) + (cos y sin B)(z — c), 
+ (cosy sina + siny cos B cos «)(y — b) z’ = (sin B sina)(x — a) — (sin B cos )(y — b) 
+ (sin y sin B)(z — c), © + (cos B)(z — c). 
Sr a ee 
Solutions to the exercises in Section 4 as : 
(ii) If A= 5 » p then a= 1,b = —1,c = —2, and 
1. is 
a= 2. Thus: 
MN = COS a sina |} cosa —sin a 
: Ex eo wa ad — be = (1 x 2)—(-1 x —2)=0, 
S cos*a+sin?a = —cosasina + sinacosa so the inverse can’t be formed. 
~ | —sinacos a + cos asin a sin? ~ + cos” « 0 1 
1 0 (iii) A=[' jf then a= 0, b= 1m 1,d=0, and 
> 


(Note that in this case, A is its own inverse). 


gk Sp St ge I a 
ea se s } 
| 7 64) 3 oF 6 
COS a Bese | COs a — 


sin & cos a |} —sina COS & 
2 cos” a + sin? « cos «sina — sinacos « 4. (i) 
~ | sinacos a — cosasina sin? a + cos? « Age I 
als a me! at 1 OTR, 
a A 2. Q2:< 1k; 
So MN =NM =I. F 1 | 1 ia 
2 R, — 5R,|0 > —% 1 JR2, 
Fe ee ree i Oe 
ad — be|c d jj —c a oe 1 =: 2\R 
SS ee oe Ss 
ee Fae ee R, — — 0 | 3 ag 
0 0 1 —3 3 |Ro 
, ‘|=! Ry, + . 0 | 3 Sy 
and 0 1 —i 21R 
oe . d - b 3 3 2b 
= —C C d I | ee 
da — db — bd 2 ae 
SAS : 
= be| —ca + ac —cb + ad = 2] 
fy ai: 
1 Check either that : A ' z a 
So AB = BA =I. ° : 
2 1 3 —4 
a = or sx |, 
3. 45) A= ; —pue the formula for A~* on p. 32 with 1 2 | -4 3 
a=2,b=7,c =1,d=6. This gives You don’t have to do both; remember that A and A7~! 


—— 1 6 = ] commute. 
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(ii) | Similarly, 
A | i (A~1A)? = AT(A~!)?. 
ba f- 8 UR But (AA~!)? and (A~!A)! are both I? = I. This shows that 
0 1 3 0 1 0 R, (A~1)? is the inverse of A’, which is what we want. 
3 0 1 0 0 1 R; 
1 3 0 1 0 0 R, 3 10. (i) 
Ct 4 Gi ee 
Bn, 1 5 1 | <4 0 1 R;, (A+ 1)(A —1)=A?+4+IA—-AI-FP 
=A?-] 
1 3 0 1 0 0 R, 
ee tee e458 -&.. and 
R;,+9R, 10 0 28 ee eg (A—I)(A+1D=A?-IA+AI-P 
a = A?-] 
to 3 os ee eae S 
eo ee ja ee, ee 
R3, + 28 [0 0 eee R3. (ii) From part (i) we have 
1 3 0 1 Oe es, (A+D(A-T=(A-D(A+D. 
= ee ee ee 
pee a a 5) Ro, Multiply both sides on the left, and also on the right, by 
fe 36 te. ae J Ra, (A +1". 
R, — 3R,, | ! ee a 0608s | Rin (A+T(A+DA-D(A+TD™ 
ee tw ws we] Ro =(A+I(A-1I(A+D(A+]T)?}; 
ee I se ts) tet Ra 
i | A : 
For safety, you should again check that the A~' you have WA —ID(A+T) + =(A+]) '(A-DI; 
found is correct by finding AA! or A” ‘A, and showing it 5 = 
ee eae a : a ee eA = 1. 
5, 
11. (i 
(A + B)(A — B) = A2 + BA — AB — B?. @) 
(Stop here, as AB + BA in general.) A | I 
; c 3 eek oe 2 
SAT ye ee 
(C-'B-'A~*)(ABC) 
= C”'B-'(A~!A)BC (using the associative law) 3 4 | 1 0] -R, 
=C'B-IBC (asA~'A =I) 3h 6 OS 
_cC-! -1p — 
= eee oo A has no inverse. 
=] ast C= ty 
or (ii) 
(ABC) = 4A(80)) '= (BC) A HO A A I 
- A. 1 -2 1 O| R, 
—1 e,° . 3 —1 0 1 R, 
As AA * =I, by definition of an inverse, A must be the 


inverse of A7}. 


(ii) A. f ee : ql R 
R, — 3R, [0 Se eee, 1} Ra, 
Let A’ = B and (A’)’ = C, and look at a typical element a;; : ; 


GLA. This becomes fhe clement a, 0) BR (as rows Delon = a ee 
columns). This in turn becomes the element a;; of C. 


1 -2 1 0; R, 
Thus (A’)? — A. R,, = 5 '@) 1 —2 = R>, 


8. 


on i i 0.| == 2] R 
(i+ Aj — AVE FE = Al -—- 1A = 2 R, + ae < ] ~ 
=J7+A-A-I (as A? = I). eae 5 2b 
=e i ose 
9. We know from Subsection 2.5 that (AB)’ = B’A’. Hence 
Letting B = A~', we have E 
Apts 
(AA~1)F = (A~2)TAT, eS 
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(iii) 
pee | I 
A 220 1 1 0 O1R, 
fo ee ee 
pe a. 6 ce 
pe 4 te: oe 
‘8, oe ti Se 
ry 23 . 0 oe 
1 0 1 1 0 O|}R, 
a af Sf ee 
Re. a O42 bei 
1 0 1 1 0 O7R, 
oe ee ee oi, 
R,, + 2 0 0 1 a 3R3, 
R,—R;, [1 0 0 3 + —71Ri, 
R,, + R3, | 0 1 es —4 5 > |Rop 
ee eee + 5 ae, 
I | Ao! 
Hence 
1 — | 
ge a ee ee 
1 -l 1 


12. We are given that 
B= Ax. 


Multiply both sides on the left by X and on the right by X71. 


AUS = 3M ARN =A. 


Thus, A = XBX~!. 
13. (i) 
A’B’ = (BAy (property of transpose) 
= (AB)! (as A and B commute) 
=f A" (property of transpose). 
So A’ and B’ commute. 
(ii) 
A~'B™! = (BA)“! (property of inverse) 
= (AB)"' (as A and B commute) 
=B™'A'' (property of inverse). 


-So A~' and B™* commute. 
(iii) 

AB" ' = (B"'B)AB"' (multiplying on the left by I) 
=B'(BA)B! (using the associative rule) 
= B'(AB)B ! ( 
= 8 'A(BB™*) ( 

=§ ‘A. 


as A and B commute) 
using the associative rule) 


So A and B“! commute. 


Solutions to the exercises in Section 5 


l. (i) 4 = (3 x 9= (1 * 4) 


= 
=15+4= 19. 
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(ii) I s|= a x2)-@ xan 


2. Using Equation (1): 


1 2 0 
2 3 —1 3 
oie ee ee 
ee ee | 
= (1 x (10 — 3)) — (2 x (—5 — 12)) 
=7+ 34= 41. 
(ii) 
1 -1 4 
Geta” =12 = 2 1 
0 3 5 
247 ae Hele Z 
Rh ee ee ee 
=(1 x 7)+ (1 x 10) + (4 x 6) 
=7+4+ 104+ 24= 41. 
3. 
1 2 3 
4 5 6 
7 8 9 
5 6 4 6 4 5 
ih ea es 8 
pe ey 
= (1 x —3)—-(2 x —6)+ (3 x —3) 
= —3+12-9=0, 
4. Using Equation (3): 
i j k 
axb=|3 —-1 4 
1 —-1 | 
sae 4; 13 tear | 
Stee i 
= 314+ j — 2k. 


5. The triangle OAB is the same as the triangle ABO, so all 
we have to do is to substitute the coordinates of O for those 
of C in Equation (6): 


Area of triangle ABO 


1 1 1 
= + modulus of | a, b, 0 
ar b, 0) 
b 0 
= + modulus ot} ia : se D1 
5b, 0 az 0 ar b, 
= + modulus of = ie 
. az b>| 
6. (i) Using Equation (9): 
1 0 1 
V = modulus of | 1 2 0 
| O55 43 
—s fiz is 
(ii) By part (i), a-(b x c) = 7. 
eee 6 ae det 
c:(a x b) = = — 
| 5 0 1 | 0 2 
=(=1 = =1).4+ (3-« 2) =7. 
1 2 0 
| 1 3 0 3 
b-(c x a) = |0 1 3 =) |= 2 : 
1 0 1 


=(1 x 1)-(2 x -3)=7. 
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Hence 


a:(b x c) =c-(a x b) =b-(c x a). 


b 

c:(b x a) = —e-(a x b) (property of cross product) 
= —A (using (11)). 

8. 


a-(b x a) =b-(a x a) (using (11)) 
= Q (property of cross product). 


9. If row 3 is a linear combination of rows 1 and 2, then 
c =k,a+k,b, and so 
A =a-(b x (k,;a + kyb)) 
=a-(b x (k,a)) + a-(b x (k,b)) 


= k,(a-(b x a)) + k,(a-(b x b)) (from P3) 

= () (from P2). 
10. @ 

, 4 Hee? ag 
Os ee 


= (1 x 26) — (2 x 22) + (7 x 10) 
= 26 — 44+ 70 = 52. 


(il) 1 2 ; 1 ) 5 
3 4 1|= 10 ee =) 
See 4 14| 
1 2 7 
ap ae? =a 
0 0 IG 
=1x —2x —26= 52. 
11. 

: : : ee : : baer napa 
2 2 4 ,) 7 4 Remember to use P1. 
ae 

= 0 2 oe 
4 ae 
1 3 6 
= —|0 72 3 
ee ee 
= —(1 x2 x -—2)=4. 
Z 1 
12. (@ = |=8+1=3 
(ii) 1 2 a8 
4 Nt 2 
2 —1 | 
Bae ee 
= | 5 = 
—| | > 1 =. a 
= (i x 3)—(2 x —5)— (1 x =1) 
S36 1 
= 14. 
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Ct ages eae | 2 3 f-4 2 3 
fi 3 eS a 2s 
ee eee BS), pee Se eee. 
ie 4 [8 6 2:4 po 

eee 2 3 
4 —— 
was ee 3 
= 0 3 
=| x} x —9x 3 

= —27, 


Solutions to the end of unit test 
Section A 


1. The answer is (b). Option (c) is wrong because it is a 
2 x 3 matrix. Remember, rows are specified before columns! 


2. The answer is (d). (Equate corresponding coefficients!) 


3. The answer is (d). (Only matrices of the same size can be 
added.) 


4. The answer is (b). (Add corresponding elements.) 
5. The answer is (a). (Multiply each element by 2.) 


6. The answer is (a). (It is the only 1 x 1 matrix among the 
options, for a start!) 


7. The answer is (c), being the only matrix among the 
options with 3 rows. 


8. The answer is (c). 
9. The answer is (e). This can be seen from the fact that the 
equations can be rewritten as 
3x, + 2x, +x3,;=4 
xX, —x,=0 
xX, + 2x, —x;=0. 


10. The answer is (c), as can be seen by multiplying both 
sides of Ax = b on the left by A ?. 


Section B 
1. (i) 
et oe fh 
K+1= 4.8 83-9 tt 6 
eae pee oo 
os 
31ft 3 
ee ae 
(il) 
X? + X = X(X +1) 
es SPSS 15 ae cae 
eo ee ee 
2 Ce De 
og 
=| 4 1 | 
2 eae 
(iii) 
X3 + X? = X(X? + X) 
es ee ee 
So ooh) ope 1 3 
> 3: ae 
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= 
A(I + A)? = A(I + 2A + A?) 
= A(I + 2A) (as A? = 0) 
= A+ 2A? 
= (as A? = 0). 
3. (i) 
(A+ B)—-(A+C)=A+B-—A-C=B-C. 
(ii) 
(A + B)? — A(A + 3B) — B(A — B) 
= A* + AB + BA + B? — A? — 3AB-— BA + B? 
= 2B’ — 2AB 
= 2(B? — AB) 
= 2(B — A)B. 
4. (i) 
L | i 
1 0 0 1 0 0 
ic | 0 0 1 0 
y Zz 1 0 0 1 
| 0 0 1 0 0 
0 1 0 —Xx 1 0 
0 z I —y 0 1 
1 0 0 1 0 0 
0 1 0 —x 1 0 
0 0 1 —y+xz —z 1 
I | Eo 
So the inverse is 
1 0 0 
—x 1 Ol: 
<P XZ er? 1 
(ii) 
Bx = Ax +c; 
(B — A)x = c. 
Now 
| 0 0 
B-A=|1 1 0}. 
1 1 1 


At this stage we can either use Gaussian elimination (giving 
us x; = 3, x, = 1, x3 = 1) or proceed as follows. 


(B — A)x =c; 
x = (B— A) “'e, 
and putting x = y = z = 1 in the result of part (i), 
1 0 0| |3 
x= |-1l 1 0; |4 
0 -1 ee SS 
3 
= $11. 
1 
5. (i) 
A*?-A+I=0; 
A-—A’=I; 
A(I — A) =I; 


so I — A is the inverse of A. 
(ii) 
(R-‘AR)? = (R~'AR)(R~'AR)(R7~'AR) 


= R~'A(RR~')A(RR~')AR_ (associative rule) 


=R™=‘*A*R = (as RR“! =I). 
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The general result is 


(R-'AR)"=R™‘'A’R___for any positive integer n. 


(Note, the inverse of an orthogonal matrix is its transpose.) 
(ii) 
A~' =A’ by part (i) 


(iii) 
(AB)(AB)’ = (AB)(B’A‘) (property of transpose) 
= A(BB’)A’ _ (associative rule) 
= AA! (because B is an orthogonal matrix) 
= | (because A is an orthogonal matrix). 


Thus AB is an orthogonal matrix. 


7. (i) x’ is 1 x n and Ax is n x 1. Thus, x7Ax is 1 x 1. 
(In practice, 1 x 1 matrices are usually thought of just as 
numbers. ) 


(il) 


bP: 
amt fe Gls 


=e = + S| 


cx, + dx, 
= [x,(ax, + bx.) + x2(cx,; + dx2)] 
= [ax? + (b + c)x,x2 + dx3]. 
(iii) Using part (ii) we must have a= 1,d = 1,b +c =2. 


There are many ways of choosing b and c; for example, 
b=c=1, giving 


| 


8. (i) y’Ax is a matrix with one element, so (y7Ax)’ is the 
same one-element matrix, i.e. 


(y7Ax)? = y7Ax. 


(ii) (y — Ax)"(y — Ax) 
= (y’ — x’A’)(y — Ax) (property of transpose) 
—_ y'y : y' Ax co xTATy + xTATAx 
= y’y — y’Ax — (y7Ax)’ + x7A7Ax 
(property of transpose) 
= y’y — y’Ax — y7Ax + x7A7Ax (from part (i)) 
= y’y — 2y"Ax + x7A7Ax. 


9. (i) s) = 4 « 8) — 1 x 2) =34 
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— 

m= ON a) 

l | 

cm x 

| foe) 

—” 

on | 

| a, 

I 

| 

= & x 

—_—_—_—_— yo 

om Nee 

II ll 
no «= 
| | 
oO = & 


ii) 


= 13 


—22 


17 


19 


N 
A ar 
Poe 
| 
CN ee eS 

pond 

| 
nx © © 

| 

eet OP ee ot 


Ix —-1x -17x —1 


= —129. 
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